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Abstract 



Primordial Fluctuations in String Cosmology 



This thesis is dedicated to the study of inflationary scenarios based on string theory. Inflation is a 
brief period of accelerated expansion in the very early Universe which gives natural explanations 
for problems of the Standard Big Bang Model (SBBM) of cosmology. The phenomenological 
success of inflation provides the motivation for numerous efforts to establish its connection with 
particle physics. As a promising contender for a unified theory at very high energy scales, 
string theory is an obvious framework to look for the theoretical foundations of inflation; we 
therefore try to determine the characteristic features of string-inspired inflationary scenarios, 
along with their observational consequences. In particular, we shall be interested in models of 
brane inflation, in which the inflaton fleld has non-canonical dymanics and where the era of 
reheating is interpreted as the mutual annihilation of (anti-)branes. 

We present first a detailed comparison of the "KKLMMT brane infiation" scenario to data from 
the Wilkinson Microwave Anisotropy Probe (WMAP) satellite. We then turn our attention 
to the consequences of modified dynamics for the inflaton: outside the string theory context, 
these scenarios are known as fe-inflation, and we establish their observational predictions with 
particular regard for the differences to the standard case (where the inflaton kinetic term is 
canonical). We study in detail the case of "Dirac Born Infcld (DBI) inflation" , which is the string- 
inspired subclass of /^-inflationary models. In a second step, we then again compare the obtained 
predictions to WMAP. Two other publications consider the end of brane inflation: firstly, we 
calculate the behaviour of entropy perturbations at the onset of brane-anti-brane annihilation in 
the KKLMMT model and, secondly, we study reheating of the Universe following multiple brane 
collisions in a novel scenario called "monodromy inflation". Keywords: cosmology, inflation, 

perturbations, cosmic microwave background radiation, string theory, branes 
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Introduction 



The scenario of inflation has shaped the subject of cosmology ever since its invention over 25 
years ago. Originally intended to dispose of some of the Standard Big Bang Model's (SBBM) 
shortcomings, it was soon realized that inflation may be responsible for a powerful manifesta- 
tion of our Universe's quantum nature: assuming the very early Universe was dominated by 
a scalar field (the so-called inflaton), the origin of today's cosmic structures such as galaxies, 
clusters, and filaments can be traced back to this field's quantum fluctuations. Stretched by the 
quasi-exponential expansion of spacetime, these fluctuations served as the primordial seeds for 
inhomogeneities that later grew under the influence of gravitational instability. 

As a discipline of physics, cosmology is as much interested in explaining how a phenomenon 
occurs (at the technical level, i.e. through its formulation in mathematical terms) as it tries to 

understand why it takes place. Technically, the inflationary scenario is appealing because of 
its simple and economic nature. However, its origin from an underlying physical theory has 
remained elusive because the (scalar) inflaton is difficult to explain from the field content of the 
Standard Model (SM) of particle physics. 

On the other hand, given that inflation possibly took place while the Universe's energy density 
was beyond the SM's predictive regime, we may have a better chance of finding the inflaton 
among the degrees of freedom of a Grand Unified Theory (GUT). In terms of both the effort 
invested and the progress achieved, string theory clearly occupies a privileged position among 
GUT candidates. Moreover, it is particularly rich in scalar fields, whose stringy interpretation 
ranges from coupling constants to the geometrical detail of extra dimensions. Prom a cosmolog- 
ical viewpoint, it hence seems natural to closely examine these string-inspired scalar fields with 
respect to their "inflationary" use. In recent years, this has been met with a warm welcome on 
the side of string theory. 

While one might not appraise string theory for its mathematical simplicity, the attempt to con- 
struct a theory that has but one parameter (namely the length of fundamental strings is) has 
proven to be very fruitful. However, the lack of string theoretic predictions verifiable in accel- 
erator experiments has been felt more and more clearly over the years. Therefore cosmology, 
as a different arena (besides particle physics) for string theory's concepts to surface, is of ut- 
most interest. It is the purpose of this thesis to walk the line between these two subjects, the 
confrontation of which has already produced a wealth of literature. 

In Part I, we lay the cosmological ground work for our analysis, introducing the concepts of 
modern cosmology including the scenario of inflation. We discuss the treatment of perturbations 
in inflation as well as their relation to today's observable quantities, explaining how to pass 
from primordial perturbation spectra through the phases of reheating and SBBM evolution 
to measurements of cosmic microwave background (CMB) anisotropies. The global picture 
(ie. beyond our own Hubble horizon) of the inflationary Universe is considered in the context 
of "stochastic inflation". 

Open issues with inflation and the need for proper theoretical anchorage in high energy physics 
lead us to an in-depth discussion of string cosmology in Part II: following a self-contained intro- 
duction to string theory, we discuss the progress and pitfalls on the way to viable string-inspired 
models of the early Universe. We give a short overview of the different types of string inflation- 
ary models constructed, with a clear focus on scenarios of brane inflation, which are behind the 
original scientific results presented in this thesis. In particular, we derive the archetype of these 
models, the so-called KKLMMT scenario, from its foundation in type IIB superstring theory. 
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Part III assembles the new scientific results obtained over the course of this thesis. Firstly, the 
KKLMMT scenario of brane-anti-brane inflation was compared with the WMAP3 data using 
the Monte Carlo Markov Chain approach. A thorough derivation of the model's cosmological 
parameters from the type IIB string theory background was applied when restricting their a pri- 
ori exploration range for consistency, and as well when interpreting the a posteriori probability 
distributions. These results are presented in the article included in Chapter 12. 
In the following Chapter, we take a closer look at the modified dynamics of the infiaton field 
if it is an open string mode, as is the case for branc inflation. Traditionally called A;-inflation, 
scenarios with a non-canonical term in string theory are of the Dirac Born Infeld (DEI) type. 
The calculation of perturbation spectra must be adapted to this new situation because the A;- 
infiationary perturbations propagate at a non-trivial sound speed Cg . We derive the scalar and 
tensor spectra in the fc-inflation analogue of the slow roll approximation (where changes in c^, 
on top of those in the Hubble parameter H, must be small), using the so-called uniform approx- 
imation. Also presented in Chapter 13 is a comparison of these ^-inflationary power spectra 
(and their DBI subclass) to the WMAP5 data. 

When the brane and the anti-branc start their annihilation at the end of inflation in the 
KKLMMT scenario, a tachyon appears and, for a short period of time, the evolution can be 
described in terms of a two-field scenario with a new, adjusted potential. This two-field phase 
(where both infiaton and tachyon are dynamic) is the subject of the publication in Chapter 14. 
It is found that entropy perturbations between the two fields can grow exponentially during a 
brief time interval, and they may accumulate enough to induce a sizable contribution to the 
comoving curvature perturbation (which is due to purely adiabatic perturbations in single field 
scenarios) . 

Reheating is also considered in Chapter 15, but this time in the context of a different brane 
infiation model: in type IIA string theory, the infiaton field can be associated with the "wrap- 
ping" of a D4-brane along a direction of monodromy in the extra-dimensional geometry. (A 
simple example of monodromy is compactification on two twisted tori.) Seeking to minimize its 
world volume (and hence its energy), the D4 will unwrap, traversing the monodromic dimension 
multiple times. If a D6-brane, on which the Standard Model of particle physics is located, sits at 
a fixed position in the monodromy direction, it will be hit multiple times by the unwrapping D4. 
However, as shown in the article of Chapter 15, the energy transfer from the infiaton towards the 
Standard Model is negligible during these collisions, with reheating taking place instantaneously 
at the last brane encounter. 

Finally, in Part IV we conclude with general remarks on the lessons learnt from the present 
work. We recall their broader scientific context and comment on the prospects in store over the 
upcoming years. Keeping unbridled optimism at arm's length, we are nonetheless confident that 
such prospects are bright. 
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Remarks on Notation 

Indices 

Small greek indices n, v . . . run from to 3, where = t is cosmic time. Small Latin indices 
m, n. . . run from to 9, i.e. over all dimensions, compactified or not. An exception is i, 
which from time to time will be taken to run from 1 to 3 only, such that {x^, x'^), i = 1,2,3 
is ordinary four-dimensional spacetime, and the pair a , 6, which we will sometimes use for 
the {p + l)-spacetime directions aligned with the world volume of a Dp-brane. Capital Latin 
indices A, B . . . run from 4 to 9 (or from 5 to 9 occasionally), that is, over extra (compactified) 
dimensions only. 

Units 

Our units are such that ^ = c = 1, i.e. in particular space and time are measured in the same 
units, which are inverse to the unit of mass. 
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Introducing Modern Cosmology 
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1. Standard Cosmology and Inflation 



In this Chapter, the cornerstones of modern cosmology are presented in broad 
terms. Based on four key observational facts, the Standard Big Bang Model (SBBM) 
offers a natural explanation for the sequence of events in the Universe ever since the 
hot Big Bang 13.7 billion years ago. Questions left unanswered by the SBBM can, to 
a large extent, be resolved by the scenario of inflation, whose mechanism we discuss 
at the background level. 



1.1. The Standard Big Bang Model 

Our understanding of the present-day Universe and its history is summarized in the elaborate 
framework of the Standard Big Bang Model (SBBM) of cosmology. The interplay between 
Nature's interactions, i.e. gravity, electromagnetism, the weak and the strong forces, allows an 
explanation of the evolution of the Universe considerably far back into the past. Using our 
knowledge of electroweak unification around an energy density of pV4 o(100 MeV), one can 
extend this description and obtain a consistent picture from the hot "primordial soup" state 
shortly after the Big Bang until today's large scale structures of filaments, clusters of galaxies 
and their substructures. We now briefly recall some elements of the SBBM picture crucial for the 
purposes of this thesis, along with their experimental evidence. A wide-angle sketch of events 
can be found in Fig. 1.1, the details being the subject of a broad range of textbooks [1, 2, 3, 4, 5]. 

On cosmological scales, gravity is the largely dominant interaction. It is described by the theory 
of General Relativity (GR), whose fundamental quantity is the metric tensor g^i^ of spacetime, 
from which the line elemend ds^ is calculated as 

ds'^ = g/.^dx'^dx'' . (1.1) 

The Einstein Hilbert action, describing the dynamics of this metric, reads 

>5grav = -^j ^''Xy/^R, (1.2) 

with the coupling constant k = SvrG = Svr/mpj = 1/Mpj, g the determinant of gf^u, and R 
the Ricci curvature scalar R = g^^R^u, constructed from the Ricci tensor R^p. Assuming that 
matter in the Universe is described by an action iSmatter, leading to an energy- momentum tensor 
T^i, (X SS^aiter/Sg^'^ , One obtains for the system 5grav+<5matter the Einstein equations of motion^ 

R/j-i^ - ^ 9ixiyR = nTfj,y. (1.3) 

To put these equations to use, wc have to i) choose an ansatz for the metric g^v in Eq. (1.1) 
and ii) specify the description of matter in the Universe, i.e. the form of T^i/. 

^We have not explicitly included a cosmological constant A in Eq. (1.3). Such a term can always be considered 
as the potential energy of a scalar field (see below) at its equilibrium position 0, A = 87rGF(^), and hence as 
a part of the energy momentum tensor T^^i/ . 
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1.1.1. A Homogeneous and Isotropic Universe 



When formalizing g^^, for the Universe as a whole, the observation of homogeneity and isotropy 
on large scales 0(~ 100 Mpc) determines the metric up to an arbitrary function of time a(t) 
(known as the scale factor), and a discrete parameter /C = — 1,0,+1 encoding the spatial cur- 
vature (open, flat or closed). The resulting ansatz is called the Friedmann Lemaitre Robertson 
Walker (FLRW) metric^ 



ds^ = -df + a^{t) 



- — -r^ + r'^ide'^ + Wedci?) , (1.4) 

1 — /Cr^ ^ ' 



which, when plugged into the Einstein equations (1.3), leads to time derivatives of a{t) on the 
left hand side. When it comes to the description of matter, the homogeneity and isotropy 
requirements are met by an ideal fluid, for which 

so that T^^^ = diag(— /?,p,p,p). The pressure p and energy density p are related by an equation 
of state, p = wp, where w can often be chosen constant. Thus, remarkably, the homogeneous 
and isotropic Einstein equations have been reduced to 

= ^P-^, (1.6) 
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" = -Up + ^p), (1.7) 
a 

known as the Friedmann and the Raychaudhuri equation, respectively. The quantity H{t) = d/a 
is called the Hubble parameter and measures the change in the scale factor of the Universe, where 
H > corresponds to expansion. In the following, we assume universal expansion as the default 
case; contracting universes with H < are briefly mentioned in Section 4.3. 
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In Eq. (1.4), the four-dimensional spacetime coordinates read {t,r,9,(p 
trade cosmic time t for the conformal time coordinate rj defined by dt 
scale factor can be factored out and the FLRW metric reads 



ds^ = a^{r]) 



-drf + 



dr^ 



1-/Cr2 



+ {de"^ + sin^ e df) 



It is often useful to 
a{vi) dr], so that the 

(1.8) 



One can then define the quantity ^(r?) = a' /a = a{t) H{t), and Eqs. (1.6) and (1.7) then read 



^2 ^ '^a'p-lC, 



--n' = --a'{p + Sp). 
a D 



(1.9) 
(1.10) 



Evidently, as seen from Eq. (1.6), H has dimensions of mpi and therefore provides a default mass 
scale, while its inverse is known as one Hubble time (or Hubble length, with our choice of 
units), i.e. the standard time scale for processes occurring in the Universe. Finally, length scales 
can be compared with H~^, but also with the "comoving Hubble length" l/(aH) = l/'H. We 
now drop the curvature term in Eq. (1.6), setting /C = for a spatially flat universe, which is 
well-motivated by observations [6] . 



1.1.2. Present Composition of the Universe 

We shall assume that globally, the FLRW metric (1.4) [or (1.8)] describes the Universe through- 
out its history, i. e. also today. We now focus on the matter side of the Einstein equations and 
discuss the present composition of the Universe. To this end, we define the so-called "critical 
density" p^rit in a flat universe with respect to the Hubble parameter at the present epoch^ Hq, 

p„it = -Hi. (1.11) 



K 



The total energy density ptot is a sum of different contributions, ptot = '^iPi^\ where each part 
pf'^ stands for an ideal fluid i with its own equation of state parameter Wi. The dimensionless 
quantities = pf^ / p„it then allow to re-write the Friedmann equation (1.6) today simply 
as Y^i^f"^ = 1. [If one wants to keep the term containing /C in Eq. (1.6), one may define 
= — /C/(aQiJo) for the contribution of the curvature, and hence J2i + = 1-] the 
present Universe, the four components contributing to this sum, their equations of state and 
their relative importance are [7] : 

Radiation 

All the photons in the Universe [most of which belong to the Cosmic Microwave Background 
(CMB), see below], with their equation of state prad = (1/3) Prad represent a tiny fraction of ptot) 
with ^l^J^ ~ 10~^. This equation of state also holds for any gas of relativistic particles such as 
weakly interacting cosmic neutrinos, which are therefore summarily counted with the photons 
into3 O!^. 

Baryonic Matter 

The constituents of ordinary matter (atoms, nuclei etc. ) behave non-relativistically (ie. without 

^All quantities referring to their current (present-day) value are designated by a subscript "0" [or occasionally 
a superscript "(0)"]. 

^Strictly speaking, massive neutrinos fell out of equilibrium recently when the temperature dropped below 
T < 0.01 eV, and only photons remain relativistic today [8]. 
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pressure) at present, hence their equation of state is = and the baryonic contribution to 
^tot (given that baryons are much heavier than leptons) amounts to Q.^^^ « 0.04. 

Nonbaryonic (or "Dark") Matter 

For a consistent explanation of many observational facts, ranging from galaxy formation to the 
CMB, it is necessary to postulate the existence of another non-relativistic matter component in 
the Universe, commonly referred to as "Dark Matter" , equally with an equation of state pdm = 
and contributing a percentage of fi^*^^ ~ 0.26. The nature of Dark Matter is the subject of active 
study and related to theories beyond the Standard Model of particle physics, see Chapter 3. 

Dark Energy 

Evidently, after summing over radiation, baryonic and non-baryonic matter, the bulk part of the 
Universe's energy density is still missing, which (together with evidence from supernovae and 
other observations) motivates the introduction of "Dark Energy" . Its equation of state should be 
very close to pde ~ —Pdei which makes a cosmological constant A (for which wa = —1 exactly) 
the frontrunner candidate for Dark Energy. Here, we shall treat both terms as synonyms. This 
component accounts for the major contribution to ptot) i-^- ^de ~ 0.7 . 

1.1.3. The History of the Universe 

Combination of Eqs. (1.6) and (1.7), as well as conservation of the energy-momentum tensor, 
V^T'^'^ = 0, yields the continuity equation 



which holds for ptot; but also for each individual component pi if the interaction between the 
different fluids i is purely gravitational. Using H = a/a and pi = Wi pi, it is easy to solve for pi 
as a function of the scale factor a: 



The index "in" refers to the energy density and the scale factor at some initial time; note in 
particular that in an expanding universe like ours one has a > am- Defining an initial fraction 
fi = pf^^/Ptot [9]) ^-iid limiting ourselves to the four components enumerated above, i.e. radiation 
(^rad = 1/3), baryonic and Dark Matter {w]^ = Wdm = 0) and Dark Energy {wde = —1), we see 
that their contributions to the total energy density vary as 



Since the scale factor grows with time, and the terms on the right hand side in Eq. (1.14) scale 
with different negative powers of a, we see that each of them dominates ptot during a separate 
era of the Universe's history. 

Present/Future: Dark Energy domination 

In the far future, only the term p^™^ = const, will be left on the right hand side of Eq. (1.14), and 

hence solely feeding into the Hubble parameter H through Eq. (1.6). One can then integrate 
the Priedmann equation to obtain the evolution of the scale factor with time, which gives 



p = -3H{p + p), 



(1.12) 




(1.13) 




(1.14) 



a{t) = Ode e 



(1.15) 
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where a^e = a(^de) is a normalization constant calculated at a i = t^e, and i?de is the (constant) 
Hubble parameter during this (final) phase of evolution. A period of exponential expansion with 
constant Hubble parameter is known as a de Sitter phase. According to the best observational 
evidence today, Dark Energy "recently" became the biggest contributor to our Universe's energy 
density, i.e. around redshift z^e ~ 1- 

Recent past/Present: matter dominated era 

In the recent past, i.e. until about 0(10*^) years ago, our Universe was dominated by matter*^. 
Hence, keeping only the second term in Eq. (1-14) and integrating Eq. (1.6), we find that during 
this era the scale factor changes as 

t 



a{t) = j , (1.16) 

where again Om = a(im)- This era was proceeded by a period of radiation domination; the point 
in time when matter and radiation contributed in equal parts to ptot; it is also referred to as the 
time of radiation matter equality teq, which measured in redshift occurs at Zeq ~ 3600. 

Past: Radiation dominated era 

Finally, going back far enough into the Universe's past, it is the last term and hence radiation 
that should have dominated the energy density of the Universe ptot- Again, an integration of 
the Priedmann equation yields a power law for the growth of the scale factor: 

t 



ait) = arad — (1.17) 

\trad/ 

The fact that the radiation contribution to the total energy density in Eq. (1.14) dies away more 
quickly than that of non-relativistic matter is easily understood from the additional decrease 
of the individual photon energies caused by the universal expansion. Photons only are subject 
to this effect on top of the number density dilution that both photons and massive particles 
experience. 

Evidently, between these three limiting cases, one may still use Eq. (1.6) in combination with 

Eq. (1.14) to find t{a) [and possibly a{t) by inversion] from t — ti^ = (da/a) [(3//«)p^t] 
In general, a universe filled predominantly with an ideal fluid whose equation of state is p = w p 
(with constant w) will expand as 

a{t) = ao ^2/(3+3-)^ a{rj) = ao ,?2/(i+3-) (i.is) 
in cosmic time t or conformal time rj, respectively. During a de Sitter phase^ it follows that 

a{v) = -^, (1.19) 

while a (X r/ for a radiation dominated universe, and a oarf' for pressureless matter. 

For a comoving observer, there is a fundamental difference between the eras of radiation and 
matter domination and a Dark Energy universe: remember that the quantity l/{aH) provides 
a measure for the observer's accessible part of the Universe. As is easily seen from Eq. (1.18), 
while a material with w > —1/3 provides the bulk energy density of the Universe, it holds 



■^As we discussed in Section 1.1.2, the total matter contribution of fib + f^dm ~ 0.3 today is more or less of the 

same order as the inferred Dark Energy « 0.7, giving rise to the so-called "coincidence problem". 
*Note that in a de Sitter spacetime, the conformal time rf is negative and runs from 77 =] — oo, 0]. 
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Figure 1.2.: Left: The Hubble diagram for type la supernovae. The scatter about the line corresponds to 
statistical distance errors of < 10 % per object. The small red region in the lower left marks 
the span of Hubble's original Hubble diagram from 1929. (Figure from [13]) Right: Hubble's 
diagram, leading to the hypothesis of an expanding universe with a linear expansion law 
V = Hr. (Figure from [10]) 



that d/dt {aH)^ > 0, and hence the comoving Hubble radius grows. For a Dark Energy (or 
cosmological constant) dominated Universe, on the other hand, d/dt {aH)~^ < 0, and so the 
comoving Hubble radius shrinks, gradually hiding ever-growing parts of the Universe from the 
observer's view. This property of exponentially expanding spacetimes becomes crucial when we 
discuss inflation in the following Section. 



1.1.4. The SBBM's Observational Pillars 



We established that, according to the SBBM, the Universe evolved from an initial state of very 
high (radiation dominated) energy density into today's composition commonly called "ACDM" 
for a cold Dark Matter Universe with cosmological constant A. The sketch of events provided 
in Fig. 1 . 1 contains some milestones of this evolution - but how can we know this much about 
events so far in the past? The following observational facts, successfully explained by the SBBM, 
are usually quoted as its "pillars" . 

Expansion of the Universe 

First observed by Hubble in 1929 [10], many experiments have since confirmed that galaxies 
(with the exception of the closest ones) are receding from us, with a velocity proportional to 
their distance, leading to redshift in their observed spectra (see Fig. 1.2). The present expansion 
rate is written as 

Ho = WOh^^, (1.20) 
s Mpc 

where the current best measurement of h is 0.719^g Q27 [''']■ Over the last decade, evidence has 
accumulated [11, 12] that galaxies at very high redshift are moving away from us with even 
higher speed than predicted by the Hubble law v = Hq r. This provides independent evidence 
for the existence of Dark Energy, the component held responsible for the missing amount of 
energy density to render the Universe flat. 

Abundance of light elements 
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Figure 1.3.: The abundance 
of light elements 
as a function of 
the number ratio 
of baryons to 
photons rib /n^ . 
The bands show 
the regions in 
agreement with 
the observed 
quantities of 
the respective 
nuclei. Com- 
patibility with 
CMB anisotropics 
is a stronger 
constraint than 
nucleosynthesis 
alone [8]. (Figure 
from [15]) 
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Observations of stars, galaxies and the interstellar medium allow an estimate of the relative abun- 
dance of light elements [i.e. of Hydrogen (which is largely dominant at 75%), Helium (providing 
most of the remaining 25%), Lithium, and Beryllium (of which, like for the heavier elements, 
there are only trace amounts)]. From our understanding of nuclear physics we can infer the 
conditions under which these elements were first produced during nucleosynthesis around red- 
shift z 10^''. Reproducing the observed abundances places tight constraints on this primordial 
environment [14, 8] (see Fig. 1.3). 

CMB AS THE ECHO OF THE BiG BANG 

The cosmic microwave background radiation, first detected by Penzias and Wilson in 1965 [16], 
has become the prime object of study for cosmologists, with the most recent measurements 
performed by the COBE [17] and WMAP [6] satellites, and more data to come from the Planck 
mission [18]. In the SBBM, the CMB photons can naturally be explained as the "echo" of the 
Big Bang released when the Universe was about tdec ~ 300,000 years old (i.e. at Zdec ~ 1100, 
after the onset of the matter dominated epoch) and first became transparent to photons: for 



8 



1.1. The Standard Big Bang Model 



t < ^dec) photons continuously scattered with free electrons, their mean path length being very 
short. Once the electrons were caught by the first nuclei to form neutral atoms, the photons 
decoupled and began to free-stream. (This is also called "recombination", though nuclei and 
electrons had never been combined before.) More precisely, recombination occurs as a two- 
step process with Helium becoming neutral earlier. However, since Helium only accounts for a 
quarter of the Universe's nuclei, it is only somewhat later at Hydrogen recombination that most 
of the photons are released [1, 8]. They reach us today with the same shape of their temperature 
distribution as at tdeo i-e.an almost perfect blackbody spectrum, with its central temperature 
redshifted by the amount of expansion 0(lO'^) that has occurred since. To high precision, their 
average temperature is Tcmb = 2.725 K, which tells us that at recombination, the Universe was 
homogeneous and isotropic on all scales up to the present horizon to at least one part in 10, 000. 
The utmost importance of the microwave background radiation in modern cosmology warrants 
two more comments: first, the high homogeneity of the CMB is a good justification of the 
FLRW metric ansatz for the metric made in Eq. (1.4). Second, the experiments cited above are 
precisely dedicated to measuring the deviation of the CMB from homogeneity, i.e. to determining 
its tiny inhomogeneities of 0(lO~^). Their spectrum is one of the key predictions of the theory 
of inflation we discuss below. 

Formation of structure via gravitational instability 

Under the influence of gravitational instability, initial matter inhomogeneities in an expanding 
universe grow larger with time, at a rate determined by the composition of this universe. Initially, 
one may describe such perturbations using a linear treatment, while the formation of the first 
stars and galaxies occurs in the nonlinear regime [19]. Very sophisticated simulations (see 
Fig. 1.4) are available for the process of structure formation, which notably allow the role of 
Dark Matter to be understood [20]: without it, matter radiation equality tgq (which crucially 
influences the speed of perturbation growth) would occur too late. Also, Dark Matter ensured 
that the period of matter domination lasted sufficiently long for successful structure formation 
before Dark Energy becomes too dominant. 

The last two points are closely related in the SBBM: CMB temperature fiuctuations serve as 
an input for the theory of structure formation, leading to predictions for the galaxy distribu- 
tion, which are well confirmed by observations. The initial fluctuation spectrum for successful 
structure formation must be nearly scale invariant, and recent measurements (see Section 2.2.5) 
confirm that the CMB spectrum indeed has this property. However, both the high homogeneity 
of the CMB radiation and the spectrum of its tiny fiuctuations have no natural explanation in 
the SBBM framework, and it is a phenomenological requirement that the initial perturbation 




Figure 1.4.: Movie stills from a simulation of structure formation in the ACDM model. The 
side of the cube corresponds to 140 miUion light years, and the first picture is taken 
at redshift z — 30, down to today on the right. (Figure by A. Kravtsov, from 
http : // cosmicweb . uchicago . edu/f i laments . html) 
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spectrum should be nearly flat. We now discuss how this situation may be remedied by an early 
period of "inflation" . 



1.2. The Scenario of Inflation 

In the previous Section, the successes of the SBBM were sketched in very broad brushstrokes. 
There are, however, some striking observational facts which remain unexplained within this 
framework, of which we now cite three prominent examples. 

Horizon problem 

We saw that the SBBM traces the origin of the CMB back to the time of decoupling, but 
how can one understand the (almost perfect) homogeneity in temperature observed for CMB 
photons from opposite directions? Turning the clock of expansion backwards within the SBBM 
framework, one finds that only photons from very close regions (within ~ 1 dcg of angular 
separation) in the sky had had time to establish thermal equilibrium by the time of decoupling. 

Flatness problem 

Why is the Universe on its spatial sections so close to flatness? SBBM evolution pushes the 
Universe away from flatness with time, and to be this close to flatness today, the Universe would 
have had to be flat with incredibly high precision initially. 

The problem of unwanted relics 

The Grand Unifled Theories (GUTs) valid at high energy densities shortly after the Big Bang 
predict magnetic monopoles (and other relics) produced during phase transitions, but these are 
not observed today. Where have they gone? 

Detailed descriptions of how each of these problems (and several others) arise within the SBBM 
are readily found in the literature, see e.g. [1, 21, 22]. Here, we content ourselves with stating 
that a remarkably effective and elegant solution to many of these problems at once has been 
found^: the scenario of inflation [23]. In this Section, we discuss the mechanism of inflation at 
the background level, along with its consequences for the evolution of a FLRW universe. 

Technically, the term "inflation" describes a period of accelerated spacetime expansion (i.e. the 
second time derivative of the scale factor a > 0) in the very early Universe, which should have 
occurred between 10~^^ and 10~^^ s after the Big Bang. Prom Eq. (1.7) it is straightforward to 
see that for a > in a FLRW universe, we need an ideal fluid with 



which violates the strong energy dominance condition p + 3p > and is tantamount to negative 
pressure. Recalling that p and p are related by the equation of state, p = w p, we see that 
Eq. (1.21) means w < —1/3. While strange in "every day" matter, the condition (1.21) can be 
met if the ideal fluid flUing the Universe is a scalar field cf), whose action may be written as 



The function V{(f)) is called the scalar field's potential and contains its mass term as well as 

®An early inflationary era is now often considered an integral part of standard cosmology. Here, we still distin- 
guish between the "traditional" SBBM and its "add-on" element inflation. 



p + 3p < 0, 



(1.21) 




(1.22) 
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interactions. The energy-momentum tensor for (f) [compare Eq. (1-5)], 

^1 



(1.23) 



allows us to identify energy density and pressure [note that in a FLRW universe, the field is 
spatially homogeneous, i.e.<j) = ^{t) only] as 

P=l<P'' + V{<f>), P=\^''-V{<t>). (1.24) 

Hence, for a scalar field p = —p + <j)^, and while (jP ^ V{<^), we can obtain^ w < —1/3 in 
agreement with Eq. (1.21). However, w for a scalar field changes with time since (f) evolves 
according to its equation of motion obtained from varying Eq. (1.22), 

■ dV 

(f) + 3H(j)+ — = 0, (1.25) 
d(p 

where the Hubble parameter H in the friction term is obtained from Eq. (1.6) with p from 
Eq. (1.24). Note that while (p^ < F(0), is essentially given by the potential^ ^{4')- 
For later convenience, we also write down the equation of motion for a time and space dependent 
field (f) = (t>{x^), obtained from variation of the action (1.22): 

^ d,{^99^''d.<t>) + '^=0 (1.26) 



y-9 

For a thorough explanation of how an inflationary phase solves the SBBM problems, we again 
refer the reader to the literature. With respect to the problem of unwanted relics, it is clear that 
inflation simply dilutes their densities to less than one relic per Hubble volume. The flatness 
problem is solved because the short period of inflation drives the Universe so close to spatial 
flatness that the entire ensuing SBBM evolution cannot push it significantly away again. In- 
flation also predicts such drastic, quasi-exponential expansion of spacetime that CMB photons 
from the entire sky were in causal contact (i.e. thermal equilibrium) before. 
The Universe has not been inflating throughout its history (though, as discussed above, its ex- 
pansion recently started to speed up again under the influence of Dark Energy), nor has it always 
been dominated by scalar field matter. Therefore, the realization that a scalar "inflaton" field 
can drive a period of early accelerated expansion (and hence solve SBBM problems) immediately 
raises two questions: How long did this inflationary era last, and how did it end? How can one 
ensure a smooth transition towards the (initially decelerated) SBBM evolution, more precisely 
to the radiation dominated epoch? 



1.2.1. Quantifying Inflation 

In Section 1.1.1, we already used two different time coordinates, cosmic time t and conformal 
time r}. For measuring the amount of expansion during inflation, it is useful to define yet 
another measure of time called the number of "e-folds" related to cosmic time by dA'^ = 
Hdt. We choose to define N counting from the beginning of inflation onwards, i.e.N{tin) = 
0. [Alternatively, the number of e-folds can be specified backwards from the end of inflation, 

'^For a positive potential, w is bounded from below by the cosmological constant value w = —1, and consequently 

the weak energy condition p + p > still holds [1] . 
*We quantify this statement in the next Chapter. 
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i.e. N{tend) = 0, so that N{t) measures the amount of inflation still to occur.] Using H = a/a, 
we see that 



Commonly, between 40 and 60 e-folds of expansion are considered necessary to solve the SBBM 
problems cited above [1, 2]. Note, however, that this number depends on the energy scale of 
inflation, and that the cited range iV = 40 — 60 is for a potential with V^/^ ~ 10^*^ — 10^^ GeV. 
The number N can be smaller if the inflationary energy is lower, the only firm lower bound being 
that N should cover the entire range of observed scales (from today's Hubble radius to e.g. the 
scale enclosing lO'^m©), which imposes > 14 [24]. (Put a different way, the "observational 
window" of scales is AA^ = 14 e-folds wide.) 

The only a priori upper bound on the total amount of inflation N^ox comes from the restriction 
that the initial field value (/^in must not lie in the regime of super-Planckian energy densities, 
i.e. V^^^{4'hi) < 1. With this constraint, however, the expansion of the Universe during inflation 

can still be much larger than the minimum requirement, of 0^10^"^°^ [25]. Additional restric- 
tions may, on the other hand, be well-motivated for an inflaton field originating from a Grand 
Unified Theory, as we shall see in later Chapters. 

Assume the condition (p^ ^ (and hence a > 0) is satisfied intially at some field value 

- how can we make sure inflation lasts? One may think of the inflaton (/> as a ball up on an 
inclined surface, starting out with negligible initial velocity compared to its potential energy at 
some position (see Fig. 1.5). If the surface is sufficiently flat, the ball does not pick up speed 
too quickly because of the friction term in Eq. (1.25), and (j)'^ <^ ^(<A) continues to hold for some 
time. Indeed, returning to the field 0, we see from Eq. (1.25) that, if the "driving force" dV/d(f) 
is small, the change in velocity d(f)/dt = (j) will be negligible, and the Klein-Gordon equation 
may be approximated as^ ?>H(f) ~ —dV/dcf). 

Recall also from Eq. (1.6) that <C V^(^) translates into H"^ ~ (k/3)F(^). Hence, if the 
potential V{(f)) is almost flat, the Hubble scale changes little with time and may be considered 
constant during one Hubble time of expansion. Why not set H = const., corresponding 

to a perfectly flat potential? Indeed, we already encountered a constant Hubble parameter in 
Section 1.1.3 when we studied the case of Dark Energy domination in the far future, giving rise 
to a de Sitter universe. Note, however, that at present we try to describe a phase of inflationary 
expansion that we know must end after a certain amount AT^ot of expansion. Considered over the 
entire duration of inflation, the Hubble parameter H does change, but during each individual 
Hubble time interval H^^, its variation is small. We cannot allow H to stay exactly constant 
as inflation would then never come to a halt. In this sense, inflation may be called a period of 
quasi- de Sitter expansion. 

1.2.2. Ending Inflation 

After the flnal e-foldof inflation, a smooth transition to a radiation dominated universe must 
take place, in which the SBBM can start out. (For this transition, the term "graceful exit" has 
been coined, and it was indeed a major problem in the first models of inflation.) Hence we need 
a mechanism for transferring the energy density stored in the infiaton field to the SBBM degrees 
of freedom such as photons and matter fields. This phase is called "reheating" , referring to the 
fact that by the end of inflation the blown-up Universe is empty and cold, and one must convert 
the inflaton into relativistic matter to start the Big Bang evolution [26] . 

®Again, we formalize this quantitatively below. 




(1.27) 
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Figure 1.5.: Left: Sketch of a flat stretch of a smgle field inflaton potential V{<j>). Inflation occurs while 
the field rolls from an initial to a final field value, expanding the Universe by a given 
number of e- folds . The amount of e- folds required depends on the scale of inflation, see the 
discussion in the text. The minimum requirement to produce all observed scales today is 
A7V = 14 [24] . Note that only the part of the potential corresponding to these scales can be 
constrained from observations of the CMB power spectrum (see following Chapter). Right: 
At the end of inflation, the inflaton field 4> drops down into a minimum and starts oscillating. 
It decays and transfers its energy to the particles of the Standard Model, triggering a phase 
of reheating from which the Big Bang model can start out. 



In the picture of the inflaton rolling down a potential, the phase of reheating sets in when cj) 
reaches some field value (pcnd where the field velocity becomes too fast and the potential energy 
y(^end) no longer dominates, (p then starts oscillating around a minimum of its potential (see 
Fig. 1.5). In this phase, couplings of the infiaton to other matter fields become important and 
are commonly summarized in an additional friction term oc (j) in the Klein Gordon equation 
(1.25), parametrized with a coefficient T^g. (Coupling to bosonic degrees of freedom is more 
effective than to fermionic fields, therefore only the former are typically considered.) Though 
difficult to treat analytically, the phase of reheating is nevertheless crucial for predictions of 
infiationary observables: it is during reheating that today's observed scales are tied to those 
during inflation (see the next Chapter), therefore the number of additional e-folds produced in 
the reheating phase (which can be important in some models) has to be taken into account. 
If the oscillating inflaton were to decay only in the usual, perturbative way (ie. each of the 
particles of the homogeneous background fleld decays independently) , a large number of oscil- 
lations would be required. However, parametric resonance decay can also occur if coherence^'^ 
of the inflaton is taken into account (leading to the terminology of "preheating" , for parametric 
reheating) [27, 26, 28]. This nonperturbative, resonant conversion of the inflaton energy into 
other scalar particles is much more efficient than the regular couplings (typically a few oscil- 
lations suffice). The effective mass and momenta of the particles produced through resonance 
can exceed the infiaton mass, which brings GUT energy scales back into play, even if infiation 
terminates at lower energy density. In this case, however, one must be careful not to re-produce 
unwanted heavy relics (whose density, given that inflation is essentially over, cannot be diluted 
away anymore). In a region of the inflaton potential where V" < 0, also "tachyonic preheating" 

^°In this sense, the Universe filled with a coherently oscillating inflaton behaves like a Bose Einstein condensate 
and hence is anything but hot [21]. 
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can occur, which is even more efficient than preheating and may require only a single inflaton 
oscillation [29]. 

1.2.3. Building Inflationary Models 

In conclusion, when building a model of inflation, one faces two choices, i ) the form of the poten- 
tial V{4>) and a) the way of ending inflation. Hence, these serve as criteria for the classification 

of inflationary models, which may be subdivided into the categories of [25, 30] large field, small 
field and hybrid inflation models. An extensive review of inflaton potential shapes (as well as 
their motivation from high energy physics, which we treat in Chapter 3) can be found in [31]. 

Let us write a simple general inflaton potential as V{4>) = A^/ [21], where the parameters 

A (the "height" of the potential) and ^ (the "width" of the potential) have dimensions of mass. 
By / we denote some general function of different shape for each class of models [21]. For large 
field models, typically one has 

F(,^) = Aff f A y , n>2. (1.28) 

The potential can also be a sum of such terms, i.e. a polynomial, and the n must be such that 
the potential is positive-definite. (Fractional values of n are also possible, and for example 
potential in the model treated in Chapter 15 is of this form.) These potentials are also referred 
to as "chaotic inflation" [32, 33]. This nomenclature [34] is due to the arbitrary choice of 
the initial field value <^\a in different parts of the Universe, i.e. the background field may be 
distributed chaotically. (We discuss this in more detail in the next Chapter.) The case n = 2, 
ie. a simple mass term F(0) = (m^/2)(/)^, is among the oldest models of inflation proposed 
[32] (and fits the data impressively well, see Section 2.2.5), as is the quartic potential with 
n = 4 and T^((/>) = (A/4)(/)^ (which seems excluded by recent observations). Note that for 
monomial potentials like Eq. (1.28), there is but one free parameter. The exponential potential 
V{^^) oc exp [—(/>/ (j5^/^ mpi)] , leading to "power law inflation" [35] with the scale factor growing 
as a power law of time, a oc f^, also belongs in the large field category. The initial field value 
(/)in measured in Planck units is large in this class of models, typically (/)in/mpi ~ 0(1 — 10), but 
avoiding the region where V ~ ni%y. During infiation, the field moves towards smaller (\) until 
it encounters (/ignd < '/'in, where inflation (but not the inflaton) comes to a halt (see Fig. 1.6). 
Then the oscillatory phase of reheating with particle production takes over. 

In the class of small field models, the values of during inflation are small compared to a given 
scale // (often the Planck mass) , and the inflaton's direction of motion is typically inversed (the 
held rolls from small towards larger values). One may write the shape of the potential as (see 
Fig. 1.6) 

n>0, (1.29) 

as it can, for example, arise from spontaneous symmetry breaking. Again it is assumed that 
after reaching (^end > ^in, the inflaton drops into a minimum where it oscillates. Note that there 
are two parameters (Agf and /i) in Eq. (1.29), which should eventually be fixed by observations. 

Hybrid inflationary models [36] involve two scalar fields and ^ with a typical potential 



y{<^) = Asf 
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Figure 1.6.: Left: The inflaton (p rolling "from right to left" on a potential of the large field type (1.28). 

The field is at its largest value initially, rolling towards smaller values. Center: An example 
of a small field potential (1.29), where the direction of motion is inversed. Inflation occurs 
for small values of (f). Right: In hybrid models of inflation with a two-field potential (1.30), 
one field (^) is trapped at a minimum during infiation (which is non-zero and provides most 
of the energy density). Along the inflationary direction 0, the potential is flat but has a 
slope towards the waterfall point 0c, where the minimum of ip turns into a maximum: the 
steepest direction in field space is now ip, and the system moves towards its true minimum. 
Note that at the critical point (pc, the energy density is still non-zero. 



where the potential for (p is e.g. of the form V{(p) = A'* [1 -|- ((/>///)"]. During inflation, only 
the field (p is dynamic, while ip is "trapped" at a minimum ■0 (see Fig. 1.6) and provides a 
non-vanishing potential energy. The effective mass of tp evolves while cp is slow rolling, = 
[4>^ — (/)^), and becomes negative at c/)^ = X-ip"^ /g"^, where ■0 changes from a minimum to a 
maximum. This usually marks the end of inflation, and ip rolls off in a different direction in field 
space towards the global minimum, around which it starts oscillating. Therefore, the tasks of 
driving and ending inflation are split among the fields (p and ip in hybrid inflation, and notably 
the potential energy is not yet minimized when <j) reaches the "waterfall point" (pc. 



1.2.4. Inflation with Modified Kinetics 



In the Lagrangian of Eq. (1.22), the scalar field (p has a canonic kinetic term, which notably 
inspired our analogy of a ball rolling down an inclined surface, picking up speed. However, 
a priori we can extend our notion of scalar field dynamics and allow the Lagrangian to be a 
general function of inflaton derivatives, while keeping the GR part of the theory unchanged. It 
can be shown that in this case accelerated expansion may be obtained on potentials too steep 
for conventional slow roll: this mechanism is known as fc-inflation [37, 38, 39], where the prefix 
k stands for "kinetically driven". The matter Lagrangian of such a theory may be written as^^ 

C^=p{X,cP), X = ]^g^,d^<P&'4>. (1.31) 

Note that p(X, (/>) cannot be a function of X only to successfully drive inflation, as it is notably 
the ^ dependence that will ensure a "graceful exit" in this scenario [1, 37]. It is usually assumed 
that p(X, (/)) admits an expansion p(X, (p) = K{(p) X + L{(p) X"^ + .... We therefore frequently 
choose the notation = F (cp, g^^ d'^<p d'^cp) — V{(p), keeping the potential term explicit. 
For the theory to be well-defined, one must have dp/dX > and 2Xd'^p/dX'^ + dp/dX > 

^^We do not change the notation ol Cf,, p, p eic. when referring to a non-canonic theory. In each case, it will be 
clear from the context whether the dynamics of the scalar field under scrutiny are standard or not. 
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[40]. The function p{X,(p) can be interpreted as the effective pressure, with the corresponding 
energy density p(X, 0) given by 

p = 2X^-p. (1.32) 

The continuity equation (1.12) remains formally unchanged with these redefinitions of p and p. 
Recall that inflation can take place while the effective equation of state p = w p obeys w < —1/3, 
which for the Lagrangian of Eq. (1.31) corresponds to X dp/dX <^ —p. While this condition 
holds, the Universe can inflate because the action depends non-linearly on the kinetic energy 
though there is no slow roll for the field 0. Note that the resulting Hubbe parameter is still 
given energy density defined in Eq. (1.32). 



For completeness, let us state that it is not inevitable to invoke a scalar field to achieve infla- 
tionary expansion [1, 30]. The dynamics discussed above may also be obtained from adding 
terms to the Einstein Hilbert action (1.2), i.e. higher powers of the Ricci curvature scalar R and 
of Rfj_uR^^ ■ The resulting equations of motion contain higher order derivatives, which means 
that the gravitational fleld has degrees of freedom on top of gravitational waves, which usually 
include a scalar field. When the additional terms in the Einstein Hilbert action are restricted 
to functions f{R), these are called "scalar tensor theories" [41, 42]. There exists a one-to-one 
correspondence between these terms and certain shapes of the potential V{(j)). 
In this Section, we discussed inflation in the early Universe, its duration and its end. We have, 
however, so far ignored a most fundamental question: what is the nature of the scalar field (f)7 
We postpone this question for one more Chapter and first turn to the theory of inflation at the 
perturbative level. 
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We explain why the inflationary scenario is considered a good explanation for the 
origin of structure in the Universe: when perturbed around its bulk, a scalar field in 
a FLRW universe predicts the spectra of primordial scalar perturbations to be nearly 
scale invariant, i.e. ideally suited as a starting point for SBBM structure formation. 
The most important tool for observational verification is to map out temperature 
fluctuations of the photons released at last scattering, which are obtained from mea- 
surements of the Cosmic Microwave Background (CMB) radiation. We show how 
CMB observations are related to the inflationary calculations, and quote recent ex- 
perimental results along with their consequences for inflationary parameters. The 
Chapter concludes with a brief discussion of the physics at very small (Planckian) 
and very large scales, at which inflation must be conceptually consistent. 



2.1. Perturbations in Inflation 

It is evident from the observation of galaxies, clusters etc. that matter is distributed inhomoge- 
neously at scales below 0(~ 100 Mpc), which makes it necessary to study the Einstein equations 
(1.3) beyond the FLRW approximation. In this Section, we therefore push both the scalar field 
(j) and the metric tensor g^y to linear order in inhomogeneous perturbations. There are good 
physical arguments for this since in a universe governed by quantum mechanics, tiny fluctuations 
5(t) and Sg^i, of quantum origin are unavoidably present. 

The linear perturbativc treatment can be reduced to the study of three dynamical degrees of 
freedom, corresponding to the inflaton itself and the two polarization modes of the graviton. 
We now show how the primordial spectra of these fluctuations may be obtained as a generic 
prediction of inflation, and expressed in terms of background quantities. When used as an input 
for the SBBM plasma and particle physics machinery, the perturbations of the inflaton field 
eventually produce an imprint in the CMB in terms of temperature fluctuations. The tensor 
perturbations of the metric lead to a gravitational wave background (i.e. ripples in spacetime 
itself), which, if they were detected by future experiments, would provide us with the earliest 
snapshot possible of the young Universe. 



2.1.1. Matter and Metric Perturbations at Linear Order 

If the Universe is filled with scalar field matter, perturbing the right hand side of the Einstein 
equations (1.3) amounts to setting^ 



subscript "0", or superscript "(0)", in this Section denotes background quantities, and not present-day values. 




(2.1) 
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We work in conformal time rj in the following. The left hand (gravity) side of Eq. (1.3) may be 
perturbed by writing the line element as 

ds'^ = gj^J dx" dx" + dx'^ dx"" , (2.2) 

where the background part gj^J dx^ dx'^ is given by Eq. (1.8). In a straightforward generalization 
of vector decomposition into a gradient and a curl part, the metric perturbations Sg^i^ in Eq. (2.2) 
may be decomposed into a scalar, vector and tensor part with respect to their transformation 
properties under the group of spatial rotations and translations: 

Sg,.u = Sgj^J+SgP + S9<;^J (2.3) 

The three perturbations decouple at linear order and can therefore be treated independently. 
As stated above, the tensor part Sg'^^ leads to purely gravitational waves^, while Sgj^J couples 
to the matter perturbation Scf). We drop the vector contribution Sgju'^ since it decays with time 
in the absence of vector stress energy perturbations. 



Scalar perturbations 

Starting our perturbative analysis with Sg'^J , we note that in their most general form, scalar 
metric perturbations can be expressed in terms of four scalar functions ip, ijj, B and E [43], 

where 'jij is the metric on the spatial sections, hence in a flat universe jij = 6ij. The full line 
element (2.2) for the background and (only) scalar metric perturbations reads 

ds^ = {(1 + 2(p)dr]'^ + 2B^idx' dr/ - [(1 - 2V')7ij - 2E^ij] dx' dx^} . (2.5) 

However, the functions ip, tp, B and E are related to 84> via the Einstein equations (which are 
by construction invariant under general coordinate transformations), therefore it is crucial to 
separate physical degrees of freedom from gauge modes. The simplest linear combinations of 
metric scalar functions that are gauge-invariant at linear order are 

1 a! 
$ = - - [a(B - E')y and * = V + - (-B - E') , (2.6) 
a a 

the so-called Bardeen potentials [44]. Likewise, it is possible to construct a gauge-invariant 
quantity 5^^^') from the perturbation dcj) in Eq. (2.1) by 

50(gi) + (2.7) 

From a physical point of view, there are only two degrees of freedom in Eq. (2.5), which can 
be represented by the potentials $ and 'f. As a consequence, two superfluous gauge degrees of 
freedom may be removed e.g. by choosing the so-called longitudinal gauge in which B = E = 0. 
This then leads to the identifications ^ = (p and = ^. 

The evolution of $ and ^ is obtained from a perturbative expansion of the Einstein equations 
(1.3) and the general Klein Gordon equation (1.26) using Eqs. (2.1) and (2.5). In particular, 

^which are automatically gauge-invariant 
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for the ideal fluid ch there is no anisotropic stress, and it is found that^ ST'''^f^^ oc (5%- and the 
two Bardeen potentials $ = \1'. The remaining free metric perturbation variable $ then is 
a generalization of the Newtonian gravitational potential. [Therefore, the longitudinal gauge 
where one sets B = E = is also referred to as "conformal Newtonian gauge" since $ = 
where ip appears in the goo component in Eq. (2.4).] The perturbed Einstein equations then 
lead to the following system of equations for $ [43] : 



$' - = 
+ m^' + {%' + 2v?) $ = 

[We use the shorthand notation V^^ = (dF/dc/))^ 
background field in conformal time obeys 



gi) 



(2.8) 

(2.9) 
(2.10) 



.] From Eq. (1.26), we also find that the 



+ 21«(/.'o + = , (2.11) 

while at the perturbed order we obtain for (5^(si) as defined in Eq. (2.1) the equation 

S(t)^^y' + 2n 5^^^'^ - V^<5(/>fe^) + aV^^ 54>^^'^ = 40^ - 2aV^ $ , (2.12) 

where evidently y<^(^ = (d^y/d0^)^_^^. The perturbed Einstein equations together with Eq. (2.12) 
can, with the help of the background relation Eq. (2.11), be combined into a single equation of 
motion. 



^" + 27"^^'-V2$ + 2<^(,f^y$ = 0. 



(2.13) 



Remarkably, we have therefore reduced the study of scalar perturbations around a homogeneous 
and isotropic cosmological background to the evolution of one single (classical) variable which 
is the relativistic generalization of the Newtonian gravitational potential. Indeed, one may even 
use a variable redefintion'^ u = (2/k) (a<&/0Q) to remove the friction term in Eq. (2.13), so that 
u obeys 

,2„ Q" . . n 



u" - - — u = 0, 



(2.14) 



Note that this equation has the form of a parametric harmonic oscillator, i.e. an oscillator with 
time-dependent frequency. 



Quantization of scalar perturbations 

For consistency, a quantum treatment of perturbations has to start from the action of a system, 
in this case from S = 5grav + 5^- Introducing as a new perturbation variable the "comoving 
curvature perturbation" 3?, 

2.15 

^By 5T'^^J'^\ we denote the gauge-invariant perturbed energy momentum tensor that can be constructed using 

Eqs. (2.6) and (2.7), see [43]. 
"^Note that $ is dimensionless, and that the factor 1/k in the definition of w ensures that this property is preserved 

after the variable redefinition. 
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the second order perturbed action 628 = (^2'5grav + <^2'50 may be written as [43] 



SoS 



II' 



). 



z = 



a<t>o 

n 



(2.16) 



[In Eq. (2.15), 3? is written in longitudinal gauge. It can, however, be expressed in a gauge- 
invariant way, which we shall use in later Chapters.] Physically, Jl describes the curvature 
perturbation on hypersurfaces orthogonal to comoving worldlines. Much like the pair of vari- 
ables ($, u) above with their respective equations of motion (2.13) and (2.14), let us define the 
Mukhanov Sasaki variable v from^ 



V = a 



for which the perturbed action 828 reads 

1 



d X [ V' 



.12 



-z , 



— V 
z 



(2.17) 



(2.18) 



In changing from 3? to the overall time-dependent factor of z^ has been removed from 
Eq. (2.16). Formally, it therefore looks like the action of a scalar field in flat Minkowski space- 
time but with a time-dependent "mass" term z" jz. As shown below, this similarity can be 
exploited to find a well- motivated vacuum definition for the perturbations. Following [43], we 
now perform the quantization for Eq. (2.18), translating the result^ back into using Eq. (2.17). 

Defining the canonically conjugate momentum from Eq. (2.18), tt = dC/dv' = w', we promote v 
and TT to operators v and ir with equal-time commutation relations 



[{)(r?,f),{}(r/,x')] 

[?!-(??, f),7r(r?,f')] 
[v(77,x),7r(r/,f )] 



0, 
0, 

iS{x — of) . 

Variation of Eq. (2.18) with respect to v gives for the evolution^ of v that 



(2.19) 
(2.20) 
(2.21) 



-v = 0. 



(2.22) 



In flat FLRW spacetime, we can use plane waves oc exp{ik ■ x) for a Fourier expansion of v, 



1 



(27r)3/2 



vM are'^-^ + v*kir]) ale 



t „—ik-x 



(2.23) 



where the creation and annihilation operators a^, ot obey the standard commutation relations, 



at, at 
fc' k' 



0, 



id^{k-k') 



(2.24) 



As a consequence, the number- valued mode functions Vk{ri) in Eq. (2.23) each obey the Fourier 
transform of Eq. (2.22) , 



v'k+lk 



z 
z 



Vk = 0. 



(2.25) 



®Note that the comoving curvature perturbation 3i is dimensionless, while [v] = mpi, i.e. the Mukhanov Sasaki 

variable has the dimensions of a scalar field. 
®This is possible because Eq. (2.16) and Eq. (2.18) are equivalent up to boundary terms. 



'^We are hence working in the Heisenberg picture. 
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where the constraint equation on the right (to be imposed on the solutions of the equation 
of motion) is called the Wronskian condition and ensures that the commutation relations of 
Eqs. (2.19)-(2.21) still hold. Translating Eq. (2.25) back into the comoving curvature perturba- 
tion's Fourier modes 0?^, one finds the equation of motion 

•311 + 2- + k'^^lk = 0. (2.26) 

Tensor perturbations 

Perturbations of tensor type in the metric are described by 

-^yff = ( l- ) ' = 0' h^, = , (2.27) 

where the two conditions on the right express that hij is traceless and transverse (as well as 
symmetric). The tensor analogue of Eq. (2.12) reads [43] 

h'lj + 2% h'ij - hij = , (2.28) 

Physically, two degrees of freedom arise from these perturbations, corresponding to the two 
polarization states of the graviton. We may therefore write hij = J2<> V't '^ij ' where where 
o = +, X and the ep^ are vectors of longitudinal and transverse polarization, respectively. The 
scalar functions evidently both obey Eq. (2.28), and we drop the index o in the following. 
For quantization, the variable is again effectively redefined as^ = a^p^j \f2K^ which, much 
like the Mukhanov Sasaki variable v in Eq. (2.25), after Fourier decomposition obeys an equation 
of motion [an equivalent Wronskian condition as in Eq. (2.25) also applies to its two solutions]: 

Vfe+(fe'-^)^Tfe = (2.29) 
Note again the parameteric oscillator structure of this equation. 

2.1.2. Solutions in the Sub- and Super-Hubble Limits 

Via Fourier transformation, we have formally introduced the "labels" A;, i.e. comoving wavenum- 
bers for the modes, which correspond to a certain length scale given by the corresponding co- 
moving wavelength A oc \jk. This wavelength A can be compared with the comoving Hubble 
radius {aH)~^ we defined earlier, which divides the modes into those which at a given time rj 
are 

• sub-Hubble: k/aH > 1, 

• or super-Hubble: k/aH < 1. 

Another way to see this is that the product a{r]) A corresponds to the physical length scale 
'^phys(^) that grows with time in an expanding universe. The sub- and super-Hubble separation 
than is according to Aphys < or Aphys > H~^, respectively. However, the Hubble scale 

changes slowly in an inflationary universe. Hence, a given mode can therefore start inside 

*The resulting v.,. has the same dimension (mpi) as a scalar field. 
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Figure 2.1.: Sketch of the Hubble crossing of a scale k during and after inflation. The horizontal axis 
measures time (e-folds), the vertical axis denotes length scales (no units are given). The 
black solid line is the Hubble radius H~^, which during inflation evolves slowly and is 
practically constant during one Hubble time. After inflation, it grows much faster first 
during the radiation and then during the matter dominated epoch. (The slope of H^^ in 
these epochs is not to scale.) The blue dashed line is an exemplary scale k that leaves the 
Hubble radius during inflation and crosses back into it (i.e. becomes observable and starts 
evolving again) shortly before radiation matter equality. Such as scale therefore probes the 
physics of inflation [i.e. the potential 1^(0)] at the time Nk when it was "frozen in". 



the Hubble radius, grow and cross out of it at a certain time ??hc to continue as a super-Hubble 
mode. This is illustrated in Fig. 2.1. The instant j^hc, where "he" stands for "Hubble crossing" is 
determined by A; = a(%c) -f^(%c)- The perturbation evolution equations (2.29) and (2.25) may 
therefore be studied in these two distinct limits, depending whether a given k is in- or outside 
the Hubble radius. 

We now discuss tensor perturbations, whose equation of motion (2.29) shows that they behave 
like a spectator field in an fixed cosmological background. Deep inside the Hubble radius, the 
"mass" term a" /a in Eq. (2.29) is much smaller than k'^, hence 

v^l + k'^v^i^ ~ 0, 

v^^ oc Afc 6-*'=^ + 5fc e^^'' . (2.30) 

The choice of constants Ak,Bi: corresponds to the choice of vacuum state for the quantized 
perturbations (see below), and they have to obey the Wronskian condition. Far outside the 
Hubble radius, we drop the k'^ term in Eq. (2.29) and therefore find 



oc a. (2.31) 

How can we interpret this result? Recall that was a redefinition of ifr^,, where iprj, oc v^/a. 
Therefore for the solution (2.31), each mode (/?^^ simply approaches a (fe-dependent) constant 
once outside the Hubble radius. This observation is at the heart of the notion of "mode freezing" : 
in terms of the damped harmonic oscillator equation Eq. (2.28), the are dynamic while 
they are inside the Hubble radius for each k, but after they have passed the point where k = 
ct-iVhc) H{r]hc), V't/c freezes to a constant value. 

For the scalar mode functions, it is easy to see from Eq. (2.25) that they oscillate analogously to 
the solution (2.30) while inside the Hubble radius. Since Eqs. (2.28) and (2.25) are related by 
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replacing a with the function z, we conclude that outside the Hubble radius, scalar perturbations 
obey Vk oc z, and therefore the modes of the comoving curvature perturbation IRfe = —Vk/z, like 
the (^^j^. in the tensor case, approach a constant for —kr] — > 0. 



2.1.3. Hubble Slow Roll Parameters 



The evolution equations (2.25) and (2.29) have the same structure apart from the replacement 
of the function z with the scale factor a. Let us rewrite this difference in an astute way using 
z = a(t)'Q/'H = a(j)o/H. From a time derivation of Eq. (1.6) it follows, with the use of Eq. (1.25), 
that H = -(k/2) (pl. Hence, we can write 

2 1/2 H 




The dimensionless quantity ei measures the change in the Hubbe parameter^, and it is a straight- 
forward consequence of the definition H = a/a that 



a 



(1 - ei) (2.33) 



and hence we need ei < 1 to ensure o, > and continued cosmological inflation. Inflation ends 
when the field reaches the value ^end such that ei(^end) = 1- 

So far, all expressions were exact. Recall that the limit of constant H (and hence ei = 0) 
corresponds to de Sitter expansion [where a = —l/{Hr]), see Eq. (1.19)]. It can then be shown 
[22] that, neglecting higher derivatives of the Hubble parameter, during the inflationary quasi 
de Sitter phase one may write 

asi--^— *— , or r/Ri — 77 • (^.34) 

Ht] 1 — ei aH 1 — ei 

These expressions are the first step in a pcrturbative expansion of the inflationary background 
around de Sitter space, which may be formalized using the so-called Hubble slow roll parameters. 

The natural unit of time during inflation are e-folds , with dA'^ = Hdt, so that the definition of 
ei in Eq. (2.32) in terms of e-foldsreads ei = —{dH/dN)/H = — dlnH/dN. This inspires the 
definition of a logarithmic derivative hierarchy of parameters defined by 

Ci+i = , « = 0, 1, . . . , eo = — . (2.35) 

i^in is the value of the Hubbe parameter at the onset of inflation. All are of the same order, 
and expressed in terms of (cosmic) time derivatives of H the first two members of the hierarchy^^ 
read 

H 1 H , , 



''Using H^aH, one finds ei = 1 - W fH^ 
^°The third Hubble flow parameter is 



e3 = 7ei-e2-6^--^-^. (2.36) 
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From successive derivations of Eq. (1.6) we find tliat 

^l = -H\r, 4>Q = ^ (e2-2ei) , (2.38) 

K 2 

wliich allows us to rewrite the background Friedmann and Klein Gordon equations (1.6) and 
(1.25) as 

K V ■ 9V' 

^'=3W3' M0-2e, + e,) = --. (2.39) 



Slow roll approximation and trajectory 

Using the e^, we can employ Eqs. (2.39) to quantify the statements made in Section 1.2.1: 

ei < 1 : H^^'^V (2.40) 

ei,e2<l: ^o^-^ (2.41) 

From Eq. (2.38), we see that ei,e2 <C 1 ensures potential energy domination and negligible 
acceleration of the field. The situation where <C 1 for all i is called the "slow roll approxima- 
tion" . It usually suffices to consider the first few members of the hierarchy, i.e. ei, e2, £3 (which, 
as wc sec below, are the ones appearing in the dominant terms of the perturbation spectrum, 
its spectral index and the running). Note that, while ei = 1 marks the end of inflation (and we 
therefore can write 0end = (pei): the slow roll approximation breaks down as soon as one of the 
Si = 1. Let us assume this happens when the field reaches determined by e2(0e2) = 1- usu- 
ally, the end of the slow roll regime and of inflation occur around the same time and (p^-^ ~ (p^^ , 
but for precise calculations one must take into account whether for a particular model the slow 
roll approximation holds until the end of inflation or breaks down earlier. 

While ei, 62 <^ 1, we can use the definition of N together with Eqs. (2.40) and (2.41) to calculate 




(2.42) 



which gives a function N((f)) measuring how many e-folds have already occurred when the field 

reaches the value (j). [In Eq. (2.42), we have written V' = dV/d(f), and we shall continue to use 
this notation for potential derivatives.] Consequently, the total number of e-folds A'tot is given 
by integrating from to ^end in Eq. (2.42). For certain choices of the potential V((p), one can 
invert N{(l)) to obtain the slow roll trajectory (?!>(-/V), which describes the time evolution of cf) 
while €1,62 <^ 1. 

In passing, let us note that Eq. (2.42) suggests another interpretation of infiaton field pertur- 
bations 6(j): they correspond to small fluctuations in the total number of e-folds 5N for a given 
point in the Universe because inflation will end slightly earlier (or later). It is possible to for- 
mulate the perturbation theory on large scales (including the calculation of spectra) in terms of 
these fluctuations in the duration of inflation. This is the "5iV formalism" [45, 46], which is well 
suited to calculate perturbations in multifield scenarios (see Chapter 4). 

Example: slow roll trajectory in large field inflation 

Inserting the large field potential of Eq. (1.28) into Eq. (2.42), one obtains that 
during the slow roll phase 

iVW = -(4-4). HN)=(A-'^n)"\ (2.43) 
n \mpi rripyj \ Att J 
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This is one example of a potential for which the exact trajectory during slow roll can 
be found by inversion. The two functions N{(p) and (j){N) for different n are plotted 
in Fig. 2.2. 
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Figure 2.2.: The number of e-foldsiV((/)) (left), the slow roll trajectory 0(A^)/topi (center) immFiq. (2.43) 
and the initial field value 0in(^tot)/TOpi (right) from Eq. (2.48) for the large field models 
with n = 2,4,6 (corresponding to the red solid, blue dashed, and green dotted curves, 
respectively) in Eq. (1.28). Note that, as is seen from the left and center plots, the same 
initial field value — S.lmpi in the n — 2 case produces the required ~ 60 e-folds , 
while iV ~ 30 and iV ~ 20 in the cases n = 4, 6 with this starting value, respectively. The 
right plot shows that one would have to choose (f>in ~ 4topi for n = A and 0in ~ 5mpi for 
n = 6 to achieve iV w 60 for these potentials. 



Characterizing the potential 



The ei parameters describe a perturbative expansion around a universe with a perfectly constant 
Hubble parameter H. Since, during slow roll, H oc F^/^, this can also be interpreted as an 
expansion around a "perfectly flat potential". While we may rely on Eqs. (2.40) and (2.41), ei 
and £2 are given by the slope and curvature of V[(j)), respectively, 



1 fV 



v'y V" 



(2.44) 



As we show below, it is the essentially the parameters ei and e2 that are constrained by observa- 
tions. Using Eq. (2.44), one may therefore learn something about the shape of the inflationary 
potential directly from CMB measurements. 

Example: ei,e2 for the large field potential 

Using the equations (2.44), we find for the large field potential Eq. (1.28) that 



,2 



2 



lovT \ cp J rupi 4^/7r 



n / mpi \ ^ 4 (if „ 1 / n 



e2 = ^C-^] =-61, = ^ - v^. (2.46) 



Note that for n < 4, €2 > ei, while 62 is the smaller parameter of the two if n 
exceeds 4. Consequently, we find (1)^2 > 4'ei (but of the same order) for n < 4, with 
(pe2 = 0ei for n = 4. Since the field moves "from right to left" in this model, it 
therefore encounters first and inflation continues very briefly after slow roll is no 
longer valid. Using Eq. (2.43), the slow roll parameters are functions of the number 
of e-folds and the initial fleld value (pin only, with 
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The parameters ei, £2 are plotted in Fig. 2.3. Note also that from combining Eq. (2.43) 
with 



, the total number of e-folds depends on the initial field value only. 
Ait 



tot 



n 



in 



mil 



n 



n 



and the required number of e-folds can be obtained if (for n 



(iVtot + 1) , (2.48) 
2) we have (j)in/mpi ^ 3. 






Figure 2.3.: Slow roll parameters ei (solid lines) and €2 (dotted lines) for the large field models of 
Eq. (1.28), using Eq. (2.47) and £2 = (4/n)ei. On the left, the case n = 2, in the center 
n = A and on the right n = 6. Note that ei and £2 coincide for n = 4, and that for n < 4 the 
second slow roll parameter is always larger than £1, i.e. the slow roll approximation breaks 
down before inflation ends. The opposite is the case for n > 4: slow roll holds in this case 
until £1 = 1. 



There exists a variety of alternative slow roll parameter sets to the one defined in Eq. (2.35), 
see, for example, [47, 48]. Very often, instead of the couple (£1,62), the two parameters (€,t/) 
are used, where £ = ei and ry = £1 — £2/2. We use the parameter hierarchy (2.35) because its 
members may be easily derived one after the other, and are all of the same order, which notably 
is important whenever higher order terms in the slow roll approximation are neglected. 



2.1.4. Exact Solutions and Primordial Perturbation Spectra 

We now use the slow roll parameters Ci to express the "mass" terms a" /a and z" /z in the mode 
equations Eqs. (2.25) and (2.29), respectively, as 



a^H"^ (2 - £1) 



a^H^ ( 2 



a^H^ ( 2 



3 1 

61 + -£2 



1 2 1 
2 £162 + 4 ^2 + 2 ^2^3 



(2.49) 



(2.50) 



where the last line contains the first order part of z" /z in the Hubble slow roll parameters only. 
Again, let us treat the simpler case of tensor perturbations first. Using the approximation (2.34) 
to express a in Eq. (2.49) (which is exact), the tensor mode equation at first order in the ei [using 
1/(1 — x) ~ 1 + X for small x] takes the form of a Bessel equation, 



1 



1 



0. 



(2.51) 



r]^ \ ^ 4^ 

Its two solutions can be written in terms of the Bessel functions J^^ and Y^^ of first and second 
kind and of order , 

3 



1,2 
V J ~ 



(2.52) 
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The complex combinations appearing in Eq. (2.52) are also referred to as the Hankel functions 
of the first and second kind or order z/^ , 



(2.53) 



We immediately see that for de Sitter space = 3/2, in which case the Bessel functions in 
Eq. (2.52) become 



kr] 



(2.54) 



Note that this solution is valid for all k. It is evident that for —krj — )• — oo, the prefactor in 
Eq. (2.54) approaches one, and we recover the sub-Hubble solution (2.30). Eventually, however, 
we are interested in those modes which crossed out of the Hubble radius during inflation and 
hence reached their long wavelength limit —kri 0. Wc already know from Eq. (2.31) that 



oc a in this case, but have yet to determine the normalization constant. 



The case of scalar perturbations is similar, but since their potential (2.50) is different from the 
tensor case, the order in the Bessel equation is modified and one finds at first order 



4' + 



k' 



1 



Vk = Q, 



2 9 3 
= I + 3ei + 2 ^2 



with the solutions 



1,2 
Vi. ~ 



V-kv {-kr]) ± iYy {-kr]) 



3 
2 



t's ~ + €i + e2 . 



1 
2 



(2.55) 



(2.56) 



We now turn to the question of the right vacuum state for perturbations: once the normalization 
for Eq. (2.56) fixed, we may calculate the exact behavior for all k. 



Choice of vacuum 



We showed earlier that the quantum treatment of perturbations has to start from the second- 
order perturbed action, which for tensor perturbations [compare Eq. (2.18)] reads 



(2.57) 



where wc have temporarily reintroduced the index o to account for the polarization states^^. Up 
to the (time-dependent) "mass" term a"/a, this is the action of a scalar field in flat Minkowski 
spacetime with (time- independent) metric g^i, = rjni,. Its special symmetry properties allow to 
define a unique vacuum state \Vt) for Minkowski space, which minimizes the energy at all times 
and is annihilated by all mode creation operators, ag|r2) = 0. Moreover, the operators and 
their adjoints (and hence the notion of "particles") are uniquely defined in this case because 
there exists a privileged set of mode functions = exp{—ikr]) / V^k in the notation of the 
expansion for tensor perturbations analogous to Eq. (2.23). 

One may still perform an expansion of the type (2.23) in a time-dependent background (as 

we implicitly did above), however, the notion of fig, at is generally no longer unique in this 
case. Notably, different choices of operators and mode Unctions Vr^j, are related by Bogolyuhov 



Recall that we have defined in a dimensionful way, = atp^l\/2K. Written in terms of the dimensionless 
original tensor hij, the action (2.57) reads ^28^'^'' = [mpi/(647r)] / d*x )' - d'{hi)] , compare 

with Eq. (2.16). 
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transformations [49], and therefore the definition of the vacuum state {^l) becomes ambiguous: 
there no longer exists a state minimizing the energy at all times. A common strategy consists 
of defining a vacuum at a fixed time rjo, which then only changes adiabatically. 
The dc Sitter background is special among time-dependent spacctimcs in the sense that one may 
define the Bunch Davies vacuum, corresponding to the vacuum state minimizing the energy for 
each mode in its infinite de Sitter past — > — oo. Therefore the corresponding mode functions 
are equal to the Minkowski mode functions for — > — oo, i.e. when they are deep inside the 
Hubble radius in their short wavelength limit. Comparing the Minkowski mode functions with 
our approximate sub-Hubble solutions (2.30), and at the same time taking into account the 
Wronskian condition, we see that we must set = I, = {) in Eq. (2.30) to recover Minkowski 
space in the infinite past. An intuitive illustration is that spacetime looks essentially fiat to 
modes k much smaller than the characteristic curvature scale (ie. the Hubbe radius) of the 
time-dependent background. 



For the exact solution (2.52) it then follows that 



V. 



-v. 



Tk 



T+y^Hi^){-kr]), (2.58) 



with the Hankel function of the first kind defined in Eq. (2.53). Using its expansion in the 
super-Hubble limit —kr) — > 0, we finally find 



2^^T- 



2k 2k^ 



r(^x) 



r(3/2)J \aH J 2P \aH 



0^777:3 77? for-krj^O, (2.59) 



where we have used i/, from Eq. (2.52). To see that Eq. (2.59) indeed approaches a constant 
(as we know it must from the approximate solution in the supcr-Hubble limit), note that the 
combination H'^ {k j aHf-'^^T indeed obeys d [H'^ {k / aHf~'^''T:] /dt = at first order in [21]. 

A similar expression is obtained for the scalar Mukhanov Sasaki variable v and the comoving 
curvature perturbation 3? = —vjz, respectively. Note that z oc ^Je\ [sec Eq. (2.32)], and that 
this time the order of the Hankel function is from Eq. (2.56), therefore we have 



,2 a' ( k 



s 

-2€l-€2 



™«2iS5Uj for-*,^0. (2.60) 

[Again, the combination [H'^ / ei){k / aH)^~'^'^s, is easily shown to be time-independent.] Since 
ei < 1 during inflation, the amplitude of scalar perturbations with wavenumbers k that became 
super-Hubble before the end of inflation is enhanced with respect to tensors. Note also that for 
tensor perturbations, the original variable was hij = (p^ e*^, i.e. one must be careful to count 
both polarization states. 



Primordial spectra 

The two-point equal time correlation function for tensor fluctuations in the vacuum, 

{^\h,,{n,x)h^^{nJm = r ^ sin(y-^l) ^3p^(^) ^ (2.61) 

defines the tensor spectrum (recall that v^^ has dimensions of a scalar field) 



16 A; 



2 



k-'r^ik) = ^ . (2.62) 

Trmpi 
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An exactly analogous definition to Eq. (2.61) for the comoving curvature perturbation J{ allows 
to write the scalar spectrum as 



87r2 



(2.63) 



After expanding the prcfactors 2^^!, r(3/2 + ei)/r(3/2) and {k/aHy^^^ in Eq. (2.59) [and 
analogously in Eq. (2.60)] for small €1,62 [2], one obtains the spectra at first order in the slow 
roll parameters: 



vrernpi 



1 - 2(C + l)ei - Ce2 - (2ei + €2) In 



k 



1 - 2(C + l)ei - 2ei In 



(2.64) 
(2.65) 



By C we denote here a constant that arises from the prefactor expansion, C = 7e + ln2 — 2, 
with 7e ^ 0.5772 the Euler constant and hence C ~ —0.73. The scale k* is called the pivot 
scale: it is picked e.g. in the middle of the observational window and used to compare the above 
expressions to observations. The choice of a pivot point singles out the moment in time when k^, 
left the Hubble horizon, /c* = a{r]^) H [r]^) , at which the zeroth order (in slow roll parameters) 
amplitude is calculated, hence the subscripts "*" in Eqs. (2.64) and (2.65). A suitable pivot 
scale is, for example, k* = 0.01 /t Mpc~^ with h the reduced Hubble parameter in Eq. (1.20). 
Note that the amplitude ratio V^/V^^ at first order in slow roll is Vj./V^ = 16ei, i.e. as stated 
above, tensor modes are suppressed by a factor ei. 



Spectra like Eqs. (2.64) and (2.65) are characterized by their indices defined as 



ns - 1 = 



din A; 



= -2ei - 62, 



d (k^V^) 
d In fc 



= -2ei , 



(2.66) 



where we have suppressed the index "*" on the right hand side indicating that the slow roll 
parameters should be evaluated when /c* left the Hubble radius. Note that these can also simply 
be read off Eqs. (2.64) and (2.65) if we suppose that the spectra (2.62) and (2.63) are of power 
law form, i.e.V^{k) oc k'^T and Vg{k) oc k'^s~^. The cases where rig = 1 and = 0, 
respectively, correspond to a scale invariant (flat) Harrison Zel'dovich spectrum. It follows from 
Eq. (2.66) that during slow roll inflation, i.e. ei,e2 <C 1, both tensor and scalar perturbation 
spectra are almost flat. Note that the tensor index and the amplitude ratio r = Vj./Vg are 
related by the "consistency relation" r = — 8n^. 

Hence, at first order in the Hubble slow roll parameters, there are four observables, the ampli- 
tudes and Arj, and their indices ng,n^. Of these, at present ^Ig and have been measured 
(see Section 2.2.5), and there are upper bounds on and n^. We discuss these results and 
how they are obtained from CMB data below. It is hoped that future experiments like Planck 
[18] will detect gravitational waves, which notably allows a test of the consistency relation. 

Pushing even further, one may also define the running , of the spectral indices n„ and 



with wavenumber k from 

dn, 



dln/c 



-2eie2 — 6263 , 



dn^ 
dh^k 



-2ei€2 . 



(2.67) 



The therefore of second order in the slow roll parameters, i.e. very small, which 

emphasizes how close to scale invariance the inflationary power spectra of Eqs. (2.64) and (2.65) 
are. (There also exists a second order consistency relation relating r, and [2].) When 
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measured at the scale A;*, one may develop the spectral amplitude of scalar perturbations (and 
analogously for tensors) around k* according to 

lnyi^(A;) = \nAlih) + (n^ - 1) In ^ + ^ In^ A + . . . . (2.68) 

Finally note from Eqs. (2.64) and (2.65) that oc i?^, while oc H'^ jex. It is therefore from 
the tensor amplitude that one may hope to uniquely fix the energy scale Rj mpi during inflation, 
while scalar perturbations only determine the ratio between that scale and ^Jt{. 



Observable window of scales 



The choice of a pivot scale fc^, defines a new measure of time by singling out the Hubble crossing 
/c*/[a(iV*) i7(A^=f)] = 1 of a given scale. In the same way, each scale k is uniquely related to 
the e-foldA^fc when it becomes super-Hubble, and we know that for observable scales today 
-^tot — ATjfc Ri 40 — 60 [50] (recall that we count e-folds from the beginning of inflation onwards) , 
where A^tot is the total expansion achieved while the inflaton field (\) moves from to i^end- To 
obtain predictions for the primordial spectra (2.64) and (2.65) from a given inflationary model 
[i.e. a choice of ^(0)] at today's observed />;, one must complete the following steps. 

Firstly, integrate the (background) system of coupled differential equations for a FLRW universe 
filled by a scalar field with potential V^(^A), given by Eqs. (1.6) and (1.25). It is usually most 
convenient to integrate the Klein Gordon equation written in terms of e-folds , 

,+ (3;.= + ^^?^)_l, + ^,.0. ,2.69) 



dA^2 

Note that the friction term may be written as Sif^ + {dH/AN)/H = H'^{^- ei). Moreover, 
numerical integration codes often use a vector of two variables (^, </)) , whose evolution is coupled 
as d(f)/dN = (j)/H and d(j)/dN from Eq. (2.69). The choice of the initial field value at t = is 
made such that (at least) enough inflation is obtained to solve the SBBM problems. A good 
estimate for this is obtained using slow roll calculations [such as Eq. (2.48) for the large field case] 
because (in standard scenarios) most of the inflationary expansion takes place in this regime. 
After integrating 0, the time evolution of H and its derivatives is known. 

Secondly, since the background integration provides us with exact knowledge of the e^, we can 
hence determine when inflation ends, ei{tei) = 1, and when the slow roll approximation breaks 
down [as soon as one ei(t^^) = 1]. While <C 1, the slow roll trajectory N{(j)) [which, again for 
the case of a large field potential, was calculated in Eq. (2.43)] should coincide with (f) obtained 
from the exact numerical integration. 

Thirdly, one can now integrate the perturbations' equations (2.25) and (2.29), where the "mass" 
terms (2.49) and (2.50) are functions of the as well as of a and H already determined from 
the background integration. [The range of k to integrate is determined from their Hubble exit, 
k = a{Nk)H{Nk), occurring around the "right time", i.e. 40 — 60 e-folds before 0e^.] Initial 
conditions on Vk-,v^^ and their derivatives are imposed such that each k mode is in its adiabatic 
vacuum deep inside the Hubble radius. 

Finally, from the knowledge of Vk^v^^, and their corresponding N^, one can directly calculate 
the spectral amplitudes and indices on large scales. We know from the slow roll solutions to 
Eqs. (2.25) and (2.29) and their limits in the super-Hubble regime that the perturbations cease 
to oscillate once outside the Hubble radius^^. 



^^More precisely, the spectra approach their late-time constant value some time after their Hubble crossing. Using 
Vk,Vrj,^, exactly at Hubble exit introduces a factor 0(2) [21]. 
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At this point, let us recall that, as we mentioned when considering the horizon problem, at the 
time of decoupling the Hubble radius i7~^(tdec) corresponded to directions within ~ 1° of angular 
separation in the sky. As a consequence, only these k have been affected by the cosmological 
evolution between the end of inflation and decoupling, and the smaller the scale, the longer the 
time period it spent under the evolution of gravitational instability inside the Hubble radius. 
At angles > 1°, we always probe larger scales than the Hubble length at decoupling, which have 
retained the primordial form of their spectra [30] . 

Example: slow roll power spectrum and index for large field inflation 

For large field potentials of the form (1.28), we obtained the trajectory (2.43), which 
gives the time dependence of the slow roll Hubble parameter when inserted into 
Eq. (2.40). If we use Eq. (2.48) to replace (t)^ in Eq. (2.47), and since €2 = (4/n) ei 
during slow roll large field inflation, we finally find for the scalar spectral amplitude 
yig and ng from Eqs. (2.64) and (2.66): 

For the ratio r = V^/V^ it follows that r = 4n/(ANk + 1). In these expressions, 
one then varies AA^^^ = N^ot — ATj. = 40 . . . 60 for observable scales. Note that from 
Eq. (2.71) it follows that the scalar power spectrum always has a red tilt, rig — 1 < 0, 
in large field models. For the simple case of a pure mass term V{(f)) = (m^/2) 
one obtains from Eq. (2.70) and (2.71) that 



-1 [-0.049,-0.033], r « [0.132,0.196] , (2.72) 

A, = ±^iAN, + lf = ^^^^-^, (2.73) 
3tt mi, Stt mi, [n„ — IV 



where the numerical values in Eq. (2.72) are for AN^ = 40 . . . 60, respectively. Us- 
ing Eq. (2.73) one can therefore observationally determine the mass scale m, if the 
amplitude of the scalar power spectrum is measured. 



Hence, the primordial spectra (2.65) and (2.64) of scalar and tensor perturbations during in- 
flation can be calculated either numerically, or in certain cases analytically in their slow roll 
approximation. They then have to be propagated through reheating and the radiation dom- 
inated epoch until the release of the CMB. From the surface of last scattering to today, the 
perturbations are then subject to the integrated Sachs Wolfe effect explained below, before they 
can eventually be compared with temperature fluctuation measurements taken today. We now 
briefly comment on the transition from quantum to classical perturbations before we turn to the 
issue of relating primordial spectra to CBM observations. 



Transition to classical perturbations, non-Gaussianity 



The continuous production of perturbations around an inflating FLRW background is a decid- 
edly quantum phenomenon: any classical perturbations are diluted away during inflation, while 
perturbations of quantum origin can develop a non-zero super-Hubble amplitude. On the other 
hand, it is evident that they must turn classical eventually because their observational conse- 
quences such as the CMB spectra and galaxy distributions do not have quantum mechanical 
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features. From the experimental point of view, the perturbations are turned classical by replac- 
ing the standard quantum average in the temperature two-point correlation function by a spatial 
average over the celestial sphere [51]. The difference between these averaging procedures gives 
rise to "cosmic variance" , as we discuss in the next Section. The details process of the classical- 
ization process of the perturbations are much more involved [52, 53]. In summary, the statistical 
properties of quantum fluctuations during inflation are interpreted as the statistical behavior of 
a classical random field after inflation. In particular, the perturbation spectra at linear order 
preserve their Gaussian character (which is due to their quantum origin, and the conservation of 
Gaussianity by linear evolution) , and hence a two-point correlation function suffices to describe 
them. Small non-Gaussian corrections can be accounted for by the three-point function, and 
may be of primordial origin {e.g. due to features in the potential leading to sudden changes of 
V and its derivatives), second order perturbation effects or non- linear evolution ("secondary" 
non-Gaussianity) . 



2.2. From Inflation to CMB Observations 

We showed that once a choice of potential V{(j)) for the inflaton is made, the spectra of both 
tensor and scalar perturbations are straightforward to calculate. We also know that the scales 
observable today left the Hubble radius about 40 — 60 e-folds before the end of inflation, and 
re-entered the comoving Hubble radius at some point afterwards during the SBBM evolution, 
which is when they became observable to us. The missing piece in this puzzle is a way to 
relate (i.e. transfer) the primordial V^(k),Vg{k) to today's density perturbation spectra Vp(k) 
obtained from observations of the CMB and large scale structure. Schematically, one may write 
this as 

Vp{k,to) = [Vpo{k)],^t^^^ T(/c, W,io) (2.74) 

where tend is the end of inflation, the index "po" stands for "primordial" and to is today. The 
function T( A;, tend, to) is called the transfer function, and in principle, it depends on all the 
physics that occurred in the Universe ever since the end of inflation. Luckily, our understanding 
of the SBBM allows to narrow its description down to a few parameters, and the calculation of 
7{k,tend,to) can be broken up into several pieces: from the end of inflation until the release of 
the CMB photons, perturbations have evolved through reheating, the radiation dominated and 
the beginning of the matter dominated epoch. Prom the time of decoupling until they deposit 
information about the primordial spectra in our detectors, these photons were essentially free- 
streaming, their frequencies being redshifted by the (decelerated) universal expansion. The 
tiny inhomogeneities in their temperature are due to the Sachs Wolfe effect [54] we discuss 
below. Though complicated and involved, the physical processes during the SBBM evolution 
are well understood, and the description of the reheating phase terminating inflation can also 
be significantly simplified. From a practical point of view, the calculation of CMB observables 
is automated to a large extent today, and computer codes (e.^. the CAMB code^^) are available 
from which the CMB spectrum corresponding to the primordial input of a given inflationary 
model is readily obtained. 



'http : / / camb . info/ 
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2.2.1. The Sachs Wolfe Effect 

The link between primordial cosmological fluctuations and temperature fluctuations on the last 
scattering surface is provided by the Sachs Wolfe effect [54], which allows the calculation of a 

photon's energy change while propagating from its release at the time of decoupling tjcc to a 
present-day detector. This change can be expressed in terms of the ratio of its temperatures 
(energies) at detection (index "d" , today) and emission (index "e" , at the time of last scattering) 
Td/Tg. At zeroth order, one simply has = [a(tdec)/ao] ^e. At flrst order in perturbations, 
photons can pick up a delay St^cc depending on their position because they have to climb out 
of gravitational "potential wells" of different depth on the surface of last scattering. Hence 
they acquire a slightly different redshift, leading to fluctuations in their temperature dT^. The 
depths of these wells in turn is a consequence of the inflationary perturbations propagated 
through reheating and the radiation dominated epoch. 

In calculating the Sachs Wolfe effect, one establishes 5tdcc(^dcc) -^c); which requires the definition 
of a surface of emission^ \ on which is the position of the photon. Then it can be shown (see 
e.g. [55]) that the relative temperature fluctuation today (we now drop the detection index "d") 
consists of three contributions, 

The first term is the dipole caused by the relative motion of our galaxy with respect to the 
reference frame of the CMB^^. The terms "S" and "T" are perturbations of scalar and tensor 

type, respectively, and they are directly related to the corresponding fluctuations of inflationary 
origin. Exact calculations show that (ST/T)^ contains three different terms [8]: i) a Doppler- 
like term (photon velocity at the time of decoupling), ii) an integral over the change in the 
gravitational potential along the photon's trajectory from tjec to to (the so-called integrated 
Sachs Wolfe effect) and in) the photon density contrast Sp^^/p^ and the gravitational (Bardeen) 
potential $ evaluated at tdec- On large scales, only the third contribution persists and its two 
parts can be combined to give 

' st\ 1 

~ -$(tdec,^e) • (2.76) 

During inflation, we previously derived the evolution equation Eq. (2.13) for $ from the Ein- 
stein equations. For the purposes of quantization, $ was replaced by the comoving curvature 
perturbation Jl, whose inflationary power spectrum is given by Eq. (2.64). Hence, if we find a 
way to propagate this spectrum from the time inflation ends at tend to decoupling, Eq. (2.76) 
relates it to the observed CMB temperature fluctuations. 



2.2.2. Conserved Quantity 

A general consideration of the Einstein equations in a universe filled with a fluid obeying p = w p 
(where w does not have to be constant) allows us to identify a convenient tracing variable for 
perturbations [55, 9, 8], commonly denoted by C, where 

2 $' + 7^$ 



'^''One possibility is the surface of constant photon density, i.e. p-yltdec) = const., in which case St^ec is proportional 

to the density contrast of photons 5p-f/p-y. 
^^The amplitude of the dipole component is (5T/T)dipoie w 1-2 • 10"^[8]. 
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This can be understood as the first integral of Eq. (2.13), and one can show that ^ fulfills 

3n 1 + w 

The physical interpretation of ( is the curvature perturbation on uniform density hypersurfaces. 
Prom Eq. (2.78), two very important conclusions can be drawn: firstly, after Fourier decomposi- 
tion and in cosmic time t, it follows from Eq. (2.78) that 1? j {a^H'^) and therefore on scales 
large compared to the comoving Hubble radius, is conserved. Secondly, from the general 
gauge-invariant definition of the comoving curvature perturbation Oi [43] [which we used in its 
longitudinal, single field form in Eq. (2.15)], it can be shown that the difference A = —(^ — 
obeys A oc k"^ /{a^H"^) as well, and hence, on large scales, they coincide, with "31 ~ —C,. 
Recall that the scalar power spectrum (2.64) is essentially given by and approaches a con- 

stant for scales outside the Hubble radius. Therefore, the relation "Rk, —C, provides the "missing 
link" to connect inflationary perturbations to a universe where the equation of state parameter 
is w. In particular, this renders the calculation of spectra independent of our microphysical 
understanding (or lack thereof) of reheating: we can simply model the transfer of energy from 
the inflaton field to relativistic particles by an effective (time-dependent) equation of state w^^ 
which smoothly interpolates between the end of inflation (with an energy density Pend) and the 
radiation dominated epoch where Wrad = 1/3- Finally, we can use Eq. (2.76) to establish the 
connection to today's temperature fluctuations in the CMB. 



2.2.3. Multipole Moments 

From an observational point of view, we measure the relative excess in temperature^® 5T (e) of 
a photon arriving from a direction e in the sky with respect to the average temperature of all 
CMB photons T. On the celestial sphere, these may be decomposed into spherical harmonics as 

^ = E E ^^-(^' ^) ■ (2-79) 

1=2 m=-e 

While the temperature T by itself is not an operator, the quantity (ST/T) is directly related 
to the (quantized) Bardeen potentials, therefore the ai^ in this expression should be considered 
as operators. Calculating their two-point correlation function for two directions ei and 6*2 then 
corresponds to calculating a vacuum expectation value. 



5T{ei) ST{e2) 



\ 1 °° 

n\ = —J2{'^^ + '^)CePe(.cos9), (2.80) 



n 

where the denote Legendre polynomials and 9 is the angle between ei and 6*2. Moreover, in 
Eq. (2.80) the multipole moments Q, which are independent of the index m, are used^^, where 



Ce 



- ^ E ■ (2-81) 



In statistical terminology, Eq. (2.80) denotes an ensemble average. Since we can only perform 
measurements in one realization of the Universe, the resulting Q are plagued by the so-called 
"cosmic variance" , 

dC, 1 



(2.82) 



'^^We consider only the scalar contribution to 6T/T in the following [see Eq. (2.75)], but drop the index "S" 
^'^Note that the dipole component of 5T/T due to the proper motion of our galaxy has been separated off. 
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which is small for large i (i.e. on small scales, for which we can take many measurement on our 
one celestial sphere), but important for the lowest multipole moments. Therefore, after choosing 
the inflaton potential V{(f)) and calculating the corresponding primordial tensor and scalar power 
spectra Vrj, (k) and Vg (k) numerically, these spectra are propagated through the reheating phase 
with an effective equation of state parameter WeS- At the end of reheating, they are carried 
through the radiation and part of the matter dominated epoch and translated into temperature 
anisotropies ST/T of the CMB photons via the Sachs Wolfe effect. A present-day measurement 
of ST/T is then expanded over the celestial sphere to obtain the multipole moments C^. The 
largest observed scale corresponds to C2 (180° separation of the photon arrival directions on the 
sky) , while the smallest scale (hence largest moment £) is determined by the resolution of the 
experiment. 

2.2.4. Polarisation of the CMB 

At this point, let us make a few comments about polarization of the CMB, i.e. an observable 
"vector field" on the celestial sphere. Prior to decoupling, the frequently scattering photons 
in the Universe are unpolarized. The process of recombination, however, is not instantaneous, 
therefore a quadrupole anisotropy is gradually produced by both scalar and tensor perturbations 
[1, 8]. In the presence of a quadrupole, Thomson scattering of photons off electrons then leads 
to linearly polarized radiation on the scales of multipoles with £ > 100, or about ~ 1° on the 
celestial sphere, since this is the Hubble size at the time of recombination. (Later in the history 
of the Universe, during the epoch of reionization, some more Thomson scattering between free 
electrons and photons can occur.) 

On top of their temperature T, polarized CMB photons require two additional parameters to 

describe them, typically called the E and B modes. (If circular polarization were also present, 
a third parameter would be required, but circular polarization is not excited by Thomson scat- 
tering.) Given that the cross-correlations {TB) and {EB) are forbidden by parity invariance [8], 
there are in principle four measurable spectra, (TT) , (EE) , (BB) and (TE). Above, we have 
only considered the (TT) spectrum, of which the WMAP satellite has provided a measurement 
of imprccedcntcd accuracy (see below). Though not designed to observe polarization, WMAP 
also put upper bounds on the {TE) component. From the Planck satellite, a more detailed 
polarization map of the sky is expected, and notably one may hope for a detection of the B 
mode: since the (BB) spectrum is only sourced from tensor (but not from scalar) perturbations, 
this would correspond to observational evidence for primordial gravitational waves. 

2.2.5. Experimental Results 

The most recent measurement of the CMB radiation was performed by the "Wilkinson Mi- 
crowave Anisotropy Probe" satellite, which released the results of five years of data taking in 
March 2008 [7, 56, 57, 58, 59, 60, 61]. Their best map of differential temperature fluctuations in 
the sky after removal of foregrounds^^ and instrumentation noise, as well as having the galactic 
plane subtracted, is shown in Fig. 2.4. The latest data release confirmed and refined the standard 
"ACDM plus inflation" model of the Universe that was established using the one- and three-year 
results [6, 62]. For WMAP, the main scientific tool for extracting cosmological information from 
maps such as the one shown in Fig. 2.4 is the temperature two-point correlation power spectrum 

^*The largest contaminant in the foreground are point sources. Other effects that can generate or amend 
anisotropies are the Sunyaev Zel'dovic effect and gravitational lensing. 
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Figure 2.4.: Foreground-reduced map (with the galactic plane removed) of CMB temperature fluctua- 
tions in the sky based on the five-year results of the WMAP satellite. (Figure from [58]). 



(TT) plotted in terms of its (conveniently normalized) multipole moments i{£ + l)Ce/2TT over i 
(see Fig. 2.5). This plot shows two distinct features. 

Sachs Wolfe plateau For small values of £ in the range of roughly 2 < / < 20, the amplitude 
of the CiS becomes almost independent of i. This is called the Sachs Wolfe plateau. (Note 
also that at small i the effects of cosmic variance as calculated from Eq. (2.82) are largely 
dominant over any measurement errors.) Information about the amplitude and spectral index 
of the primordial inflationary spectra is encoded in the height and slope of this plateau because 
its I correspond to scales > 1° that were still outside the Hubble radius (and therefore "frozen 
in") at the time of decoupling. This part of the spectrum plotted in Fig. 2.5 is therefore not 
very sensitive to other cosmological parameters (except the cosmological constant A). 

Acoustic peaks After the Sachs Wolfe plateau, the amplitude rises towards the first of a 
series of "acoustic peaks" due to baryon acoustic oscillations in the plasma at the time of last 
scattering. These £ correspond to scales inside the Hubble radius (and therefore dependent 
on cosmological parameters) when the CMB was released. After its Hubble re-entry, a given 
perturbation scale in the tightly coupled plasma of baryons and photons develops a standing wave 
oscillation of the baryon density {i.e. a standing acoustic wave). At decoupling, this acoustic 
oscillation ends because no more Thomson scattering takes place, cutting off the pressure support 
for the density oscillation. The peaks and troughs in the CMB spectrum (see Fig. 2.5) capture the 
oscillations just before they came to an end: the first acoustic peak, for example, corresponds to 
the comoving wavelength just entering the Hubble radius, which had enough time to perform only 
one oscillation and therefore was at its maximum amplitude {i.e. not damped yet) at decoupling. 
The finite thickness of the last scattering surface eventually causes a cutoff in the peak spectrum 



36 



2.2. From Inflation to CMB Observations 



due to damping out of the anisotropies around I > 800: this effect is called Silk damping and 
describes the fact that coupling of photons to the baryon and electron plasma is still present at 
these scales but is no longer perfect. 

This structure in itself -a plateau followed by a sequence of peaks- confirms that the spectrum is 
predominantly scale invariant and adiabatic. As stated earlier, the height and slope of the Sachs 
Wolfe plateau can be interpreted in terms of the amplitude and spectral index of the primordial 
perturbation spectra. On the other hand, the position and relative height of the peaks provide a 
wealth of information about the early Universe, and the best-fit theory spectrum corresponding 
to the red line in Fig. 2.5 found from WMAP observations only is characterized by the parameter 
set {^^b/^^^^m/^^ A2,n3,r,i?'o} = {0.0227,0.131,2.41,0.961,0.089,72.4} [7, 60]. As before, h is 
the reduced Hubble parameter, related to Hq by Eq. (1.20), Qi, and arc the baryon and matter 
densities, respectively, and the spectral index of the scalar perturbations. The parameter r 
is the so-called reionization depth (where the optical depth to an event taking place at redshift 
z is the scattering probability of a photon integrated from z until today) [8]. The amplitude of 
the scalar perturbation spectrum at leading order here is denoted by = k^V^/ {2'k'^). This 
minimal set of six ACDM parameters characterizes a flat Universe dominated by a cosmological 
constant with adiabatic and nearly scale invariant Gaussian fluctuations. The sensitivity of the 
peaks and troughs to various cosmological parameter is discussed in detail e.g.ui [1, 8], and 
many details on how the WMAP5 values are obtained arc given in [7, 59]. These references also 
show how more stringent bounds can be placed on the parameters, if more datasets than just 
the WMAP results are included. 

Tensor perturbations in principle show the same structure in the multipole moments [8], but 
their amplitude falls off rapidly for £ > 60 and they are therefore largely subdominant in the 
region of the acoustic peaks. They do, however, contribute to the amplitude of the Sachs Wolfe 
plateau, i.e. they add power to the spectrum on smaller scales I < 60. Therefore, their presence 
is degenerate with a red-tilted spectrum < 1, equally leading to an excess of power for small 
i. As an example, the likelihood distribution obtained for the scalar spectral index is plotted 
in Fig. 2.6, and its degeneracy with the tensor to scalar ratio r (which, in single field inflation, 
is r = — 8ei) is shown. The quantities r and can be calculated numerically for a given 
inflationary model, and one may characterize an inflaton potential V{<l)) by its "coordinates" 
(ng,r) in parameter space. For models of the large field type, this is done in Fig. 2.7 (see the 
caption of that figure for a discussion). Similar plots for other generic classes of models using 
the WMAP5 data can be found e.c^.in [59, 63]. 

Example: scalar spectrum normalization in chaotic inflation 

For simple models such as the chaotic potential V{(f)) = {m? /2)(jP' with only one 
free parameter, i. e. the mass m, one can use the slow roll approximation to establish 
the link with observations "by hand" via the WMAP quadrupole (^rms-Ps/^! since 
fc^'Pg oc (Qrms-Ps/^)^- Taking into account numerical prefactors [like e.g. the factor 
1/3 in Eq. (2.76)], using the slow roll spectral amplitude (2.64) at leading order and 
the Hubble parameter from Eq. (2.40), one can establish that [64] 

^.^%^^, (2.83) 



where Vk and e^^ ' are the potential and the first parameter evaluated when the 
observable scale k left the Hubbe radius. Using the slow roll trajectory of Eq. (2.43) 



37 



Chapter 2. The Inflationary Toolkit 



6000 



4000 



2000 




100 200 400 

Multipole moment / 

Figure 2.5.: Temperature angular power spectrum obtained from WMAP5. Grey dots are unbinned 
data, wliile the black points are the binned data with la error bars calculated from both 
noise and cosmic variance. (At low multipoles, cosmic variance is largely dominant.) The 
solid red line is the WMAP-only best fit ACDM model with the parameters given in the 
text. (Figure from [7]) 



for n = 2, one finds 



m 



2 /ic;^ n2 



45vr Qrms-PS 



With the measured value Qrms-ps/^ ~ 6 x 10 ^ [58] and for AA^^ = 50, this deter- 
mines the mass of the chaotic inflaton as m/m^i ~ 1.4 x 10~^. 



2.3. Inflationary Predictions 



Above, we explored the consequences of an early inflationary era in the Universe at both the 
background and the perturbation level. We argued that while the microphysics at play during 
inflation and at the transition to the radiation dominated epoch (reheating) are complicated 
and largely unknown, they can be expressed in terms of a few generic parameters, which allow 
one to broadly distinguish different classes of scenarios. At the top level, we may isolate some 
cast-iron predictions of inflation independently of its concrete realizations. 

Spatially flat homogeneous and isotropic Universe 

Inflation dilutes away all previous classical inhomogeneities in the observable patch, i.e. "our" 
Hubble volume. Therefore, the observable Universe must be described by a FLRW metric. 
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Figure 2.6.: Left: WMAP5 constraint (blue) on the scalar spectral index from the marginalized one- 
dimensional probability distribution. The grey line is the previous result from WMAP3. 
Center and right: Two-dimensional marginalized constraints on r and obtained from 
WMAP5. The darker shaded region is the 95% confidence level region, the lighter shading 
shows 68% confidence level. (The grey lines show the previous regions from WMAP3.) The 
pivot scale here is chosen at k^, = 0.002/ Mpc. In the central plot, no running of the 
spectral index is assumed, while the right plot allows for a non-zero a^. (Figures from [7]) 



Though inflation only last for a tiny fraction of a second, the violent exponential expansion 
renders the Universe so close to spatial flatness that all of SBBM evolution cannot push it away 
again. 

Almost scale invariant scalar and tensor power spectrum 

The spectrum of both scalar and tensor perturbations may be calculated from inflation, and is 
found to be very close to the scale invariant Harrison Zel'dovich case. The scalar perturbations 
couple to matter and are hence observable as temperature fluctuations in the CMB. Tensor 
perturbations (decoupled from matter at linear order) lead to a background of gravitational 
waves, whose amplitude is suppressed and therefore difficult to detect. If it were observed, 
however, it would allow a direct determination of the energy scale of inflation. 

Almost Gaussian perturbations 

The statistics of the perturbations are, at linear order, unaffected by the expansion that occurs 
after their generation. Since their spectra are calculated from expectation values taken in the 
vacuum inflationary perturbations obey Gaussian statistics, while small deviations from 
Gaussianity may be induced e.g. by features in the inflaton potential, non-canonical dynamics 
and backreaction. 

Adiabatic, coherent perturbations 

Inflation (with a single scalar field) predicts that the perturbations are adiabatic^^ and generated 
in a coherent manner. Though a lot of complicated particle physics enters into the calculation 
of the observable CMB spectra from the primordial inflationary perturbations, coherence is 
important to obtain the "acoustic peak" features of the CMB multipole moments Ce. This is a 
crucial difference between perturbations generated by a scalar field like the inflaton, and those 
from residual defects such as cosmic strings because in the latter case, coherence is absent and 
hence no peak structure in the CMB produced. 

Observations so far lend ample justification to the hypothesis of an early inflationary phase. Since 
the ej constrained from CMB measurements are a calculational output of a concrete scenario of 
inflation [see Eq. (2.44)], they put limits on the shape of the inflaton field's interactions. To some 

^^We discuss multifield inflation and perturbations of isocurvature type in Chapter 4. 
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Figure 2.7.: Left: Localization of various inflaton potentials of the chaotic type in the (n^, r) parameter 
plane (see legend for the identification of the symbols). The horizontal axis is the spectral 
index n^, with a scaling as in Fig. 2.6. The darker shaded region is the WMAP5 95% 
confidence level contour, while the lighter shading indicates the 68% confidence level region. 
Note that the simple V{(j)) — {m? /2)(f>'^ model (white circles) still fits the data very well, 
while the quartic potential oc is well outside the 68% contour. (Figure from [59]) Right: 
Though not originally designed as a polarimeter, WMAP has been able to measure the cross 
polarization spectrum (TE), plotted here in terms of its multipole moments Cj^. Again, a 
peak-like structure is obtained; note that the vertical axis here is {I + l)Cj^/(27r). (Figure 
from [60]) 



extent, this is already possible with today's WMAP5 data (see Section 2.2.5), and the Planck 
mission will provide much more stringent bounds. Therefore, if we can derive these interactions 
from a theoretical framework, as we discuss in the following Chapter, observations may be able 
to discriminate between them. 



2.4. Beyond the Standard Treatment 

In this Section, we discuss two aspects of inflation related to physics at the smallest and the 
largest scales, respectively. From a conceptional point of view, these issues have to be ad- 
dressed within the inflationary framework, though their consequences may not be accessible to 
us experimentally. 

2.4.1. Physics of the Very Small: Trans-Planckian Effects 

We previously mentioned that the minimum amount of inflation is bounded by SBBM problem 
solutions. Most models, however, predict a number of e-folds much bigger than this requirement. 
In terms of the mode labels k we introduced when studying perturbations, the number of e- 
folds measures the "stretching" of a given k (or of the wavelength A, respectively), since the 
physical length associated with it is proportional to the scale factor, Aphys = a A. Scales observed 
today were smaller in the past and one finds that, if the amount of inflationary expansion is 
large, certain modes k should have originated below the Planck scale ipy. We know, however, 
that at the Planck scale at the very latest we must touch onto the realm of new physics. 
The issue of trans-Planckian physics in inflation^'' was first raised in [68, 69] and one may study 

■^"Previously, similar questions had been studied in tlie context of black hole radiation [65, 66, 67]: following a 
black hole photon back into its past closer and closer to the event horizon, its wavelength undergoes a blueshift. 
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it from two different vantage points: on the one hand, one must hope that the trans-Planckian 
past of observable scales does not affect the predictions of inflation too severely because these 
predictions were derived using General Relativity and quantum field theory, both of which must 
be cast into doubt close to the Planck scale. In this sense, one is interested in the robustness of 
inflation's predictions to trans-Planckian effects [70, 71]. On the other hand, the prospect that, 
via the exponential expansion of the Universe, these high energy scales may lie within the grasp 
of observations is tantalizing. Inflation can then be understood as a "Planck scale microscope" , 
and the more wc know about physics close to the Planck scale, the more concrete predictions 
we can test using precision cosmological measurements. 

Inspired by the strategy pursued for a similar black hole problem, it was argued that the dis- 
persion relation in the perturbations' equation of motion Eq. (2.25) may be modified in order to 
mimic trans-Planckian effects [72, 73, 74, 75]. This is reminiscent of condensed matter physics 
where, at wavelengths comparable to the lattice spacing in crystals, the dispersion relation de- 
parts from the linear relation w^, oc k. Hence, in the inflationary context, if a mode's wavelength 
is comparable to the "spacing of spacetime" at the Planck scale, it may experience effects of 
this discretization. Therefore, at short distances, the time-dependent frequency of the harmonic 
oscillator of Eq. (2.25) changes, and one replaces k by an effective wavenumber keS- 
This modification, however, is engineered by hand and hence lacks fundamental justification. 
Starting from first principles assumptions about a theory of quantum gravity, one may therefore 
take a different approach [76, 77, 78, 79, 80]: a finite minimum length (corresponding to the 
discretization length of spacetime) is introduced via quantum gravity correction terms to the 
commutation relations of quantized perturbation theory. Such corrections arise from a high 
energy modified Heisenberg uncertainty relation, AxAp > | [l + /3(Ap)^ -|- . . . ] . Here /3 is a 
positive constant related to the "spacetime spacing" Axmin by Axmin = V^- 
Independent of the framework chosen to describe them, once a scale k has grown sufficiently for 
its mode function to reach the standard form of Eq. (2.25), the trans-Planckian effects manifest 
themselves as a vaccum choice different from the usual Eq. (2.58). Traces of the other (decay- 
ing) branch of the two-dimensional solution space of Eq. (2.25) should persist, meaning that 
the mode is not in the perfectly adiabatic vacuum in the far past. In the CMB spectrum, this 
non-adiabaticity of the vacuum takes the form of super-imposed oscillations, which are already 
tightly constrained [81, 82]. 

2.4.2. Physics of the Very Large: Eternal Inflation 

The observation of homogeneity and isotropy on scales 0(~ 100 Mpc) justifies the use of the 
FLRW metric to describe our observable patch of the Universe, which is 0(~ 3000 Mpc). How- 
ever, we have no means of knowing if the Universe remains homogeneous and isotropic on even 
larger scales: it is possible that globally, spacetime is highly inhomogeneous and made of distinct 
"Hubble bubbles" , each corresponding to a separate observational patch with possibly different 
laws of physics (see Fig. 2.8) [83, 84]. This immediately raises new questions: how did this 
global structure arise, and how can we develop a measure of probability to account for it, given 
that we are trapped within our own bubble? 

Quantum fluctuations of the inflaton field 6(j) were previously treated as small corrections to the 
homogeneous background (po. However, the typical size of quantum fluctuations for a massless 
free scalar field in a de Sitter universe is given by S(f) c±. -H"/27r, and hence S<p depends on the value 

However, for the type of dispersion relations studied, it was found that any trace of high energy physics is 
erased thermodynamically from the observed photon spectrum of a black hole. 
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Figure 2.8.: Left: The eternally inflating self-reproducing Universe with its inhomogeneous structure of 
separate "Hubble bubbles" on very large scales. The different colors of the bubbles indicate 
different realizations of the laws of physics, arising from the initial field value in the bubble. 
Whenever the field fluctuates to a value where the Universe can start inflating in the slow 
roll regime, a new bubble begins to form. Right: Sketch of a typical large field ("chaotic") 
inflation potential with the regions of small and large quantum fluctuations. While the field 
value is big enough, the field can move upwards on the potential through quantum jumps, 
in the opposite direction than its slow roll classical trajectory dictates. (Figures from [85]) 



of the Hubble parameter, which in turn is a function of the background scalar field value (pQ. 
During slow roll inflation, H is given by the potential only, see Eq. (2.40). Therefore, depending 
on where (po is located on its potential, quantum fluctuations can have different amplitude. For 
illustration, let us assume a potential of large field type, z.e. the inflaton starts at a large field 
value (pin (in Planck units) and rolls towards smaller ones. Therefore, the classical field motion 
described by Eq. (2.41) is "to the left" (see Fig. 1.5), and during a time interval At, the field 
typically moves by [see Eq. (2.41)] 



A4> 



XL 

3H 



At 



V 



(2.85) 



where we used one Hubble time for the time interval At along with the slow roll expression 
Eq. (2.40). This classical movement "to the left" has to be compared with the quantum jumps 
54> — H/2tt that occur randomly to the right and to the left. Classical and quantum fluctuations 
are of equal amplitude if 



\A<P\ 



hence 
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(2.86) 



Using Eq. (1.28) and Eq. (2.40), we find that for the large field case this holds true for 

2/(n+2) 



3 



1/2 



n I mpi 
4 I 



(2.87) 



Therefore, in the region where (p > (p*, quantum effects can exceed the amount of classical motion 
occurring during one Hubble time, i.e.(p can largely compensate its slow rolling downwards 
Eq. (2.41) by a quantum jump upwards on the potential. Extrapolating this argument, we see 
that at the next step (given that it is still in the regime (p > i;^,,, ) , the inflaton again has a chance 
of moving upwards instead of rolling down etc. It could therefore forever stay in the region where 
quantum effects are dominant. Only once a quantum jump in the same direction as classical 
motion allows the field to reach values (p < (p^ can the slow roll evolution take over [83, 84]. 
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Recall that we previously also determined the scale (pqg [see Eq. (2.48)] where quantum gravity 
effects become important because V{(l)qg) ~ nipy For large field potentials of the form Eq. (1.28), 
it is related to (f):^ by 

rrtpi \3 J \nj \mpi J 

ie. there exists a regime where quantum jumps 5(f) are important, but still (p < (/)qg. 
Classically, the field value in large field inflation continuously decreases from ^in, which is 
bounded from below by the requirement of Ni^t = 40—60, but can be much larger (as long as the 
regime of quantum gravity is avoided). One may consider the region inside each Hubble radius 
H^^ as a separate universe [one of the bubbles in Fig. (2.8)] since it is causally disconnected 
from the rest of spacetime (which can comprise many other size regions). We can therefore 
think of the initial field value as a number "pulled out of a hat" (with a certain probablity) 
for every Hubble volume separately: those regions where (pin is at least large enough to create 
A'tot e- folds of inflation are a priori candidates for our part of the Universe. However, if (pin is 
0{(p*) as calculated in Eq. (2.87), the field may undergo an erratic sequence of jumps 6(p first 
before its value has dropped enough to start inflating according to Eq. (2.41). In this context, 
one may speak of an "eternally self-reproducing inflationary universe"' [84], since each region 
where inflation does set in immediately produces a volume large enough to fit our own Hubble 
bubble, and continuous quantum jumps ensure that there are always regions left in which 
still exceeds The consequences of this are the subject of the "stochastic inflation" approach 
[86, 87, 88, 89, 90, 91]. 

Example: stochastic efTects in chaotic inflation 

For n = 2, one can write the potential as a simple mass term T^(0) = (m^/2) 
Therefore, we have from Eqs. (2.87) and (2.88) that 

.V'V^5n)"=«10', *!i=('5|lV'7AV«io«, (2.89) 



rupi \167r/ \ m J ' mpi \ 3 / \mpi/ 

where the numerical values are for m/mpi ~ 10~^, a typical value found from normal- 
izing perturbation spectra to the CMB, see Section 2.2.5. Recall that the required 
minimum initial field value (pi^ is O(mpi) in this model, see Eq. (2.48). Clearly, if one 
were to choose (p-m ~ (</**)) the number of e- folds produced is A'tot ~ 0(lO®) [see 
Eq. (2.48)], i.e. much larger than the required amount. We also see that quantum 
fluctuations dominate the motion of the inflaton field in chaotic inflation long before 
the scale of quantum gravity. 



Stochastic inflation: quantum fluctuations interpreted as "noise" 

These considerations motivate the idea of "coarse graining" the inflaton fleld over the entire 
Universe at a scale of roughly the Hubble volume, i.e. each separate region is assigned a 
background field value (pci averaged over its volume (see Fig. 2.8), and all fiuctuations on smaller 
scales are summarily considered as "quantum noise". Technically, this is achieved by dividing 
the field into its long and short wavelength modes and "smearing out" the latter ones. The 
classical field (pci (comprising only the long wavelength modes) by definition exactly obeys the 
slow roll equation of motion Eq. (2.41). The field (p, re-interpreted as a stochastic quantity, 
instead obeys a Langevin equation, 
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where the stochastic function ^ (t) describes the quantum noise produced by the short wavelength 
modes. By H' we denote the derivative of the Hubble parameter with respect to (p (not with 
respect to conformal time rj, as it was the case for H' earlier). It is important to note, however, 
that the derivation of Eq. (2.90) is straightforward only in a de Sitter background (where the 
Hubble parameter is constant, H' = 0, and (f) corresponds to a test field in a fixed background) 
[87], and especially the normalization factor in front of ^(t) is obtained by considering jumps of a 
test field in de Sitter spacetime. It is then generalized to time-dependent backgrounds, but one 
must be careful because cf) no longer behaves like a test field, but back-reacts on the spacetime. 
The (Gaussian) noise is completely characterized by its statistical properties'^ 



It is well known [92] that a Langevin equation of the general form 



(2.91) 



dt 



where for Eq. (2.90) we have 



a{<P) = —H', P{cP) 
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(2.92) 



(2.93) 



is related to a Fokker Planck equation which describes the evolution of the probability distri- 
bution P{(t),t) for finding a given field value (f) inside a Hubble domain at time t. This Fokker 
Planck equation is written as [92] 
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where the coefficients a{(f)) and b{(l)) are related to the a{(l)),/3{^) of Eq. (2.92) by 
a{ct>) 
b{ct>) = 



a(^) + \ ^ im? 
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(2.95) 
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It should be noted that there is an ambiguity in defining the Fokker Planck equation for a given 
Langevin equation which is related to the question of whether, in the limit of continuous time, 
the coefficients in Eq. (2.92) are evaluated before or after the quantum jump has occurred [92]. 
Here, we exclusively use the Stratonovich rule for deriving the Fokker Planck equation, which 
corresponds to the usual laws of differentiation. (Alternatively, one may use the ltd rule.) The 
inflationary Langevin equation for the stochastic field (j) Eq. (2.90) hence corresponds to a Fokker 
Planck equation obeyed by the probability distribution P{^, t) which reads'' [93, 94] 
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^'^For simplicity, we only consider the case of "white Gaussian noise" here. "Colored" noise functions are related 

to more complicated window functions used for separating long and short wavelength modes. 
^^One may also write Eq. (2.98) in the form [93] 
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A stationary (i.e. equilibrium) solution of this equation for which dPeq{(p)/dt = would be of 
particular interest since the field then should be distributed according to -Peq(0) at late times 
t ^ oo. Prom Eq. (2.98) or Eq. (2.97), respectively, one can show that [95] 



Peq{4>) oc exp 
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where Eq. (2.40) has been used to replace H^. 

Let us now consider the probability that the inflaton field takes a certain value at time t under 
the condition that its value at some earlier time t' < t was e.g. equal to x (in particular for 

t' = tin and X = (f'in)- This distribution P((j),t\x) then obeys a backward Kolmogorov equaMon 
[92] and it can be shown that a stationary solution to both this backward equation and Eq. (2.98) 
is 
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This distribution cannot be normalized if the potential V{(j)) vanishes at its minimum (which, 
following its classical motion, approaches for late times t). Intuitively, one can understand 
this as the absence of quantum jumps as soon as the field reaches < 0*: it is then only subject 
to the classical force that pushes it downwards on the potential, there is no counteraction trying 
to increase (p again and therefore no stationary distribution can be established. 
The way out of this puzzle leads to the realization that Picp, t) does not account for the relative 
size of the Hubble domain that contains the value at time t: the larger 0, the larger the 
Hubble parameter H for a large field potential. Therefore, the physical probability distribution 
Pphys{<p,t) should obey [95] 
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where the extra term accounts for the growth of the corresponding volume during a given time 
interval. Again, a corresponding backward Kolmogorov equation for -Pphys(0) ^Ix) holds and it 
can be shown that in this case there indeed exists a normalizable stationary solution. 



Perturbative solution in the noise 



A different approach for obtaining the probability distribution of the inflaton field (f) once it has 
been turned into a stochastic quantity consists of using the expansion [88, 96] 

m ^ Mt) + Hi{t) + SMt) + ■■■, (2.102) 

in the Langevin equation (2.90), where 6(f)i{t) is of first order in the noise ^{t), S(j)2{t) is of second 
order etc. . A consistent expansion of the coefficients in Eq. (2.90) leads to differential equations 
for 5(t)i{t) and S4>2{t), which can be solved in terms of integrals of H and its derivatives, multiplied 
by one or two noise functions ^{t), respectively. The idea then is to exploit the properties of 
the noise (2.91) to obtain the mean values {Scpf) and {6(j)2) in terms of the classical field (pci{t) 
[which exactly solves Eq. (2.41)] only. 

To tie the perturbative approach of Eq. (2.102) to the probability distribution P(0, t) satisfying 
a Fokker Planck equation, note that the probability distribution P((/!>, t) may also be defined as 

P{cl>,t) = {6{(t>-cl>m , (2.103) 
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where (p[S,] here is the stochastic field that is expanded around in Eq. (2.102). The mean 
(. . .) in Eq. (2.103) is calculated with respect to the functional probability distribution of the 



noise, 



V[/^] =7Vo exp 



(2.104) 



where Aq is a normalization coefficient given by Aq = [T>^ exp (— ^ eC-'^)]-' and we use the 
shorthand notation f^g = J dr f{T)g{T). One can then show that the approximation Eq. (2.102) 
in the Langevin equation (2.90) gives an approximate probability distribution solution P{(j), t) 
to the Fokker Planck equation (2.98): 
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For the case of standard inflation, this distribution has been studied for different types of po- 
tentials in [88]. In this thesis, we present (Chapter ??) the generalization of the perturbative 
approach of Eq. (2.102) to the case of Langevin equations originating from an inflaton equation 
of motion with a non-canonical kinetic term. 



In this Chapter, we pushed the description of a FLRW universe filled with scalar field mat- 
ter to first order in perturbations, and calculated the resulting spectra for scalar and tensor 
perturbations. The former leave an imprint on the CMB radiation reaching us today from the 
surface of last scattering, and provide the seeds for structure formation during the SBBM epoch. 
Having firmly established benefits and predictions of infiation, we now address the question of 
the inflaton's identity. 
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Searching for an inflaton candidate in high energy extensions of the Standard 
Model^ (SM) seems promising because the energy scale of inflation is possibly close to 
the realm of Grand Unified Theories (GUTs). In this Chapter, we present a concise 
overview of attempts to embed inflation within a GUT framework, along with the 
difficulties generally encountered. 

3.1. From the Standard Model to Grand Unified Theories 
3.1.1. The Standard Model of Particle Physics 

All experiments in the laboratory to-date confirm the predictions of the Standard Model of 
particle physics. Established over the second half of the twentieth century in close interplay 
between theoretical progress and accelerator discoveries, the SM explains the strong, weak and 
electromagnetic interactions from gauge theories based on the groups S'?7(3)c x SU (2)l x U{1)y- 
Generally speaking, if a particle is charged under a certain symmetry group, it partakes in the 
corresponding interaction: S'C/(3)c gives rise to quantum chromo dynamics, describing processes 
between coloured particles (hence the subscript "C"), while SU{2)i^ x U{1)y stands for the 
unification of quantum electrodynamics and the weak interaction within the Glashow Weinberg 
Salam model [97, 98, 99] at energies of 0(200 GeV). The subscript "L" indicates that the elec- 
troweak interaction is left-handed, and "Y" stands for the weak hypercharge Y . The symmetries 
5?7(2)l X U{1)y are then broken down into the weak and electromagnetic forces observed at 
lower energies by virtue of the so-called Higgs mechanism [100, 101, 102]. 

The Standard Model matter particles, called quarks and leptons, are fermions (i.e. particles of 
half-integer spin) and can carry color (for quarks only), weak and electric charges. Forces are 
mediated by messenger particles (which are bosons, i.e. their spin is integer): eight gluons g for 
the strong force, three weak gauge bosons W^,Z^ for the weak interaction and the photon 7 
for electromagnetism. Photons are massless, which means that the electromagnetic force has 
infinite range^, and reflects the fact that a residual electromagnetic U{1) symmetry is unbroken 
after SU{2)l x U{1)y ^ U{1) cjii via the Higgs mechanism. Gluons are also massless [S'f7(3)c is 
unbroken], but the strong interaction they mediate is confined to the very short distance scales 
O(l0~^^m) inside atomic nuclei. Fermions and weak gauge bosons, on the other hand, are 
massive: their masses are generated in the process of electroweak symmetry breaking when the 
Higgs particle, the only elementary scalar (spin-0) field in the SM, acquires a non-zero vacuum 
expectation value. 

Fermion and boson families of the Standard Model of particle physics are summarized in Fig. 3.1. 
With the exception of the elusive Higgs particle, they have been detected and their properties 

'^of particle physics (as opposed to the SBBM) 

^As does gravity; we shall see in later Chapters that in string theory, gravitational interaction is mediated by a 
massless spin-2 particle called the "graviton" . This motivates the interpretation of string theory as a quantum 
gravity candidate. 
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Figure 3.1.: Left: The particle families of the Standard Model of particle physics. Each particle is 
accompanied has a corresponding anti-particle with opposite charges. (Figure by American 
Association for the Advancement of Science) Right: Running of Standard Model coupling 
constants with and without supersymmetry. ai is the coupling constant of weak hypercharge 
Y, 0L2 that of the chiral SU{2) interaction, and as the coupling of quantum chromodynamics. 
They seemingly unify around an energy scale of 0(10^5 GeV). (The horizontal energy scale 
is expressed in terms of the gauge boson mass.) The unification becomes exact when 
supersymmetry is assumed. (Figure from [104]) 



measured in high energy accelerator experiments [103]. The large discrepancy in mass between 
leptons and quarks is explained by their different coupling strength to the Higgs. In contrast to 
leptons, quarks are only observed in "colorless" combinations of three (baryons) or two (mesons), 
held together by gluons. 

The SM is a renormalizable theory, meaning that it depends on a finite number (in this case 19) 
of input parameters to be determined from experiment. Moreover, these parameters depend on 
the energy scale at which the interactions are probed: in the process of renormalization, fields 
and couplings are redefined to absorb counter terms (necessary to cancel divergent Feynman loop 
diagrams) and hence pick up a dependence on the high energy cutoff Ayy of the theory. In this 
sense, the Standard Model is an effective theory that may be used to calculate processes up to 
Auv, where it should be replaced with a more general framework. Indeed, measurements of the 
SM gauge coupling constants at current accelerator energies suggest that the electroweak and 
the strong interaction unify further around 0(lO^^ GeV) (see Fig. 3.1). This raises the question 
whether a fundamental theory can be found for all three, i.e. strong, weak and electromagnetic 
forces, much like electroweak unification is achieved by the Glashow Weinberg Salam model. 



3.1.2. Supersymmetry and Supergravity 

Great effort has been dedicated to the construction of such "Grand Unified Theories" , which 
may rely on an extended gauge group like 50(10), broken at energies below O(l0i5 GeV) to 
recover the Standard Model. An important building block of many GUTs is "supersymmetry" , 
a symmetry that extends Lorentz invariance by introducing spin-1/2 group generators. At 
the most basic level, supersymmetry predicts a "superpartner" for each SM particle, i.e. for 
each fermion in Fig. 3.1 there should be a boson with the same mass, and vice versa. In 
later Chapters, we discuss this formalism in more detail; textbook descriptions are given e.g. in 
[105, 106]. Any supersymmetric version of the SM therefore comprises at least twice its particle 
content. Immediate consequences of supersymmetry are that i) the SM gauge coupling constants 
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now unify precisely at one point (see Fig. 3.1), and ii) there are now (at least) two Higgs 
doublets^. Conceptually, it is possible to introduce not only one, but a number A/" > 1 of 
supersymmetries, which increases the particle content even further. However, to eventually 
recover the chiral character of the cicctrowcak interaction (under which left-handed particles 
transform as doublets, while the right-handed ones are singlets), is is advisable to focus on the 
J\f = I case. The simplest extension of the SM using supersymmetry is called the "Minimally 
Supersymmetric Standard Model" (MSSM). 

Note that gravity is no integral part of the Standard Model picture. Indeed, GR is fundamentally 
different from the SU{3)c x S'C/(2)l x U{1)y framework: apart from the "running" of coupling 
constants, renormalizability also imposes restrictions on the form of allowed interaction terms in 
the SM Lagrangian. Only terms with dimensionless coupling constants are permitted, limiting 
to four the number of fields participating in any given vertex. In the case of gravity, however, 
we know from the action (1.2) that the coupling constant k is dimensionful. Therefore, gravity 
has to be added "by hand" to the SM to obtain a complete description of Nature. 
A priori, supersymmetry is an extension of the Standard Model only and hence does not de- 
scribe gravity. However, when supersymmetry is turned from a global into a local symmetry (by 
making its infinitesimal Grassmann transformation parameter spacetime dependent), it includes 
gravity (i.e. invariance under general reparametrizations of spacetime). The resulting theory is 
then called "supergravity" , and in this process one passes from a potentially rcnormalizablc the- 
ory (global supersymmetry) to a non-renormalizable one with an infinite number of parameters. 
[As a consequence, non-renormalizable terms in the "superpotential" W{^), see below, may be 
allowed in supergravity.] A common strategy is to consider globally supersymmetric models as 
the limit of supergravity scenarios when nipi — t- oo, though this approach has its limitations. 
Since we do not observe an equal mass bosonic partner for e.g. the electron, even M = 1 super- 
symmetry must be broken (and the SM recovered from the MSSM) below a certain energy scale. 
In the theory's local form as supergravity, the breaking occurs spontaneously (i.e. via a non- 
zero vacuum expectation value, like electroweak symmetry breaking by the Higgs), usually in a 
"hidden sector" of unobserved particles, from where it is communicated to the "visible sector" 
of SM particles. The resulting theory in the visible sector then looks like global supersymmetry 
with explicit, so-called^ "soft" supersymmetry breaking. 

The study of inflationary scalar perturbations in the previous Chapter showed that the energy 
density during inflation can be close to the GUT scale. More precisely, the Universe could have 
been at this energy scale during the observable part (i.e. the last ~ 60 e- folds) of inflation, with 
the limit p < lO^^rripj derived from the non-detection of primordial gravitational waves [24]. 
At these scales, supersymmetry (or supergravity) sets the stage. Moreover, in contrast to the 
SM with its lone Higgs scalar, supersymmetry makes an abundance of scalar fields available for 
inflationary model building. 

3.1.3. Supersymmetry Multiplets 

Let us briefly examine the minimal fleld content of a supersymmetric model. In A/" = 1 super- 
gravity, we have at our disposal i) the chiral supermultiplet (from which supersymmetric matter 
is built) containing a Weyl spinor and a complex scalar field, ii) the vector supermultiplet (for 

^Scalar fields always come in pairs in supersymmetry (which can be written e.g. as the real and imaginary part 
of a complex scalar) since two degrees of freedom are needed for the superpartner for a spin-1/2 fleld. 

''Soft supersymmetry breaking is characterized by the fact that it gives the supersymmetric partners of quarks 
and leptons masses within the "correct" range 0(100 GeV — 1 TeV), i.e. such that they are heavy enough to 
have escaped detection until now, but light enough not to interfere with the SM Higgs mechanism. 
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supcrsymmctric gauge particles) made of a Weyl spinor and a masslcss vector (spin-1) and in) 
the gravity supermultiplet with the spin-2 graviton and its superpartner, the gravitino (spin- 
3/2). With inflation (i.e. scalar fields) in mind, we focus on the chiral supermultiplet, which we 
write as It may be expanded in "superspace" ^ in component fields [105] 

^x'',e,9) = (l){x'') + V2 9ij{x'') + 9^ Fixf") (3.1) 

+idi,<p{x^') 9a^'9 -^9^ d^iPix^") a''9 - ] d^d^'(P{x^') 9"^ 9^ , 
V 2 4 

where is a complex scalar, a spinor and F an auxiliary scalar field. (Auxiliary fields like 
F are introduced to make the supersymmetry algebra close off-shell, i.e. without use of the 
equations of motion.) In the above expansion, we have not displayed the spinor indices, and the 
denote the usual vector of (2 x 2) matrices = {I, a), where the a* are the Pauli matrices. 
The conjugate of the (left chiral) superfield $ is denoted by We discuss the form of the 
general supersymmetric Lagrangian for the superfield $(x'*, 9, 9) in both its renormalizable and 
in its supergravity version below (see Section 3.2.3). 

The two-component field can be composed e.g. into its real and imaginary part (we denote 
the complex conjugate of ^ by 0), or, alternatively, into a radial and an angular mode (much 
like the decomposition of a symmetry-breaking field like the Higgs). Both its degrees of freedom 
are natural supersymmetric candidates for the inflaton, and there can be n copies of the chiral 
multiplet of Eq. (3.1) in a supersymmetric theory, giving ^j,^ = l,...,n complex fields. It 
should then in principle be possible to derive the effective inflaton potential T^(^i) from the 
chosen supersymmetric particle physics model, e.g. the MSSM. Our strategy is the following: 
we first discuss conditions on the most general type of inflaton potential, before turning to the 
question how these can be satisfled by a supersymmetric candidate fleld. 



3.2. Inflation and Supersymmetry 

3.2.1. Conditions on a General Inflaton Potential 

In a general effective field theory, the inflaton potential with an inflnite number of terms reads 

V{4>) = V,±'^ct>''±\^'' + Y,Xn n^'--^/' r (3.2) 

n=5 

where the first (constant) term plays the role of vacuum energy, and the second term gives the 
scalar field its mass. A linear term is absent by a suitable choice of origin for the field (j). Gener- 
ically, the only renormalizable interaction term present is the quartic self-coupling. (A cubic 
term is allowed by renormalizability, but makes it necessary to check explicitly if the potential 
is positive definite.) One must have A < 1 to be in the pcrturbative regime. The couplings 
Xn of the non-renormalizable terms (in which the Planck mass appears explicitly via n) are 
generically 0(1) if the cutoff scale is Planckian Auv ~ JT^pi [or |A„| 2± (mpi/Auv)" in general]. 
Their values should be calculable within the theory that replaces the effective framework at the 
cutoff. The Xn vanish in the limit where the cutoff scale Auv — ?• oo (i.e. , with respect to the 
above discussion, when supergravity is replaced by global symmetry for mpi — )• oo). One must 
be careful when taking this limit during infiation because the coupling to gravity is essential to 

* Supersymmetric fields are functions not only of the spacetime coordinates , but also of anti-commuting 
Grassmann variables 6, 6. 
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drive spacetime expansion, and hence the terms oc A„ can be problematic when (f> > rripi. 
It is possible that the underlying theory beyond the cutoff prescribes the exact form of the po- 
tential [25] and that, for example, only the quadratic or the cubic term are present in Eq. (3.2), 
corresponding to the popular large field potentials V{c/)) = [m^ /2)cj)'^ and V{(t)) = (A/4)(/)^. If, 
however, we consider Eq. (3.2) in its full generality, we see that, discarding accidental cancel- 
lations between the non-renormalizable higher order terms (note that the A^i can have either 
sign), a canonically normalized field (f) inflates if the constant term Vq dominates [24]. This is 
usually expressed in terms of a potential slow roll parameter r]v with rjy oc V" /V ^ i.e. a mea- 
sure of curvature for the potential^. The condition \r]v\ ^ 1 for prolonged slow roll inflation 
then puts upper bounds on A and the A„ [31, 24], which are increasingly difficult to satisfy the 
larger the field value (f) is compared to mpi. One finds that, since the range of for which 
inflation takes place cannot be made arbitrarily short, the A„ are required to be unnaturally 
small, i.e. fine-tuned. 



3.2.2. Supersymmetric Lagrangians 
Chiral supersymmetric Lagrangian 

From the superspace expansion (3.1) of a superfield (and its analogue for the conjugate $|), 
one may determine the components of products like $i<&j and From the supersymmetry 

transformation properties of the former, it can be shown that any product of only $i's is again 
a (left) chiral superfield, while a combination behaves like a vector superfield (whose 

superspace expansion we have not written out here, but is readily available in the literature 
[105]). Moreover, one can exploit the integration over the Grassmann coordinates {9,9) of 
superspace, for which J d0 = 0, / d0 = 1 etc. , to "extract" components from these superfield 
expansions. The most general supersymmetric Lagrangian for chiral superfields can then be 
written as 



>Csusy = / I] + / W{^) + h.C. 



(3.3) 



where for the integrations we use the notation 6.'^9 = d^9d?'9 and d?9 = — d^d^/4, the spinor 
indices again being suppressed. The function 1^(<I>) is the superpotential we mentioned earlier 
and contains products of the superfields $i (and not their conjugates $|) only. [The reverse 
applies to its hcrmitian conjugate W'^ , indicated by "h.c." in Eq. (3.3).] By renormalizability, 
only up to cubic powers (i.e. at most a term $j<I>j<I>fc) are permitted. The integration J d^0 
then projects out the so-called "F term" of W{(^), i.e. the coefficient of 99 in the superspace 
expansion of and/or which contains the auxiliary fields Fi. 

The first term in £susy provides the kinetic terms for the component fields of the and its 
notation in Eq. (3.3) is explained as follows: in the superspace expansion for a vector superfield 
[in analogy to Eq. (3.1) for the chiral superfield], an auxiliary pseudo-scalar field appears, which is 
usually denoted as D. Since the product transforms as a vector superfield, its expansion has 
a "D term" , which multiplies the product 9999 of superspace coordinates. This term is projected 
out by the d^0 integration in Eq. (3.3). An alternative notation for the chiral supersymmetric 
Lagrangian i2susy of Eq. (3.3) is therefore 

>Csusy = E [^^^1] J, + + h.c.} . (3.4) 



^In terms of the Hubble flow parameters evaluated in the slow roll limit, tjv = 2ei — €2/2, see [47, 48]. 
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Note that in global supersymmetry, the kinetic terms of the components (pi and tpi are canonical, 
i.e. using the expansion (3.1), one finds 



D 



FjPi + ^ d''(t>i - ^ d^d'^cl^i + h.c.) + ^ {i^i a'' d^^i + h.c.) . (3.5) 



Chiral supergravity Lagrangian 

When supersymmetry is made local, the supersymmetric Lagrangian of Eq. (3.3) generalizes to 



--sugra 



(3.6) 



where 3C($''', is a general function of the superfields and their conjugates, and therefore non- 
minimal kinetic terms for the component fields are possible. Since the theory now includes 
gravity and is not renormalizable anyway, there is also no longer a restriction on the form of the 
superpotential W{^). 

It turns out that the scalar component fields of chiral multiplets only enter into the 

Lagrangian via a specific combination. 



+ log|t^|^ K{ 



-31og(-X/3) , 



(3.7) 



where K((p, (f)) is often called the Kdhler potential, and W and % are the superpotential VF($) 
and the general function 3C($^^,$) of Eq. (3.6), with the superfields replaced by their scalar 
components. For the purpose of inflation, our main interest is the bosonic part >Cb of supergravity 
Lagrangians like Eq. (3.6). Up to a metric determinant, and in units where we set k = 1 for the 
moment, £5 can be obtained using the function G((/>, ^) only. 



C^,<x-R + G'^D^4>iDi'4P + e 



G 



G, (G-^) -3 



(3.8) 



where R is the Ricci scalar, i.e. the gravity part of the theory. The indices for G indicate 
derivatives with G^ = dG/df, G' = dG/dcpi, and also G'j = d'^G/{d(j)id^3). By G'^ we then 
denote the inverse matrix of second derivatives. The second term in Eq. (3.8) then provides 
the kinetic terms for the scalar fields (p, (/). Note that is a function of the (f) only (not the 
(p), therefore it docs not contribute to the second derivative G*^, and one may write G*y = 
-fC'j. Consequently, a minimal kinetic term i^*- 



J is recovered for a general function % = 
-3exp(-$i$*t/3) in Eq. (3.6), which, using Eq. (3.7) gives G = 4>i<j/ + log \W\'^. The last term 
in Eq. (3.8) provides the so-called tree level effective potential ¥{(/)) for the scalar fields. 



v{cp,4>) 



G^{G-^) -3 



(^K~'^ D^W DjW - 3\W\'^^ 



(3.9) 



Here, DiW = W,i + K^iW is a the "covariant derivative" of the superpotential W . If the kinetic 
terms are minimal and only renormalizable terms are present in the superpotential iy($), we 
recover the globally supersymmetric Lagrangian (3.3). We return to the potential (3.9) in detail 
in later Chapters within the context of string theory. 



No scale supergravity 



In the supergravity limit of the superstring theories that are studied in Part II of this thesis, one 
usually deals with "no scale" supergravity, therefore here we briefly state the definition of these 
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models. In no scale supergravity, the Planck scale is the only mass scale in the picture, i. e. there 
is no intermediate scale introduced (such as e.g. the gravitino mass) by effects supersymmetry 
breaking etc. In these models, the Kahler potential has the form (if there is but one scalar field 

G = -31og(0 + ^), _ = ^ = -3(* + r'. ^=3(0 + 0)-. (3.10) 

(To make contact with the notation in later Chapters, note that is then replaced ^ —i<i>-) 

As a consequence, one has from Eq. (3.9) that V{(j)) = for all values of (j). Hence, at tree 
level, the field (j) is massless, le. without potential, and its value can be changed at no energy 
cost. Fields with the property that they do not enter into the tree level effective potential 1^(0) 
obtained from supergravity are referred to as "flat directions" . 



3.2.3. Inflaton Potentials from Supersymmetry 



Lagrangians for scalar component fields 



Let us now put our understanding of the Lagrangian (3.8) of complex scalar component fields 
4>i of chiral supcrmultiplets to "inflationary" use. We saw that two functions K{(j)m,4>m) and 
W{(t)i) enter into the scalar Lagrangian C^f, = £kin — ^(0i)- Of these, K((prm 4>n) is the Kahler 
potential from which the scalars' kinetic terms are derived, 

-Ckin = X] -^"^^ dfj,(j)m d'^(t)n , Kmfi = ^ ■ ^7 • (^-H) 

m,n "^'^ 

The derivatives K^nfi therefore play the role of a "metric in field space" . The function W{(f)i) is 
obtained from the superpotential VF(<I>) by inserting the expansion (3.1) in terms of component 
fields. Let us specialize to the case of canonical kinetic terms for the scalar fields for now (as it is 
the case in global supersymmetry, before it is turned into supergravity). The scalar interaction 
potential is then obtained from derivatives of the superpotential only [compare Eq. (3.9)], 

i 

and is called the "F term potential" because after solving the supersymmetric equations of 
motion for the auxiliary field Fi in each chiral multiplet, Eq. (3.12) is equivalent to V((j)i) = 
[Recall our discussion of the superpotential F term projected out by integrating over 
J in Eq. (3.3).] From this it follows that supersymmetry can be spontaneously broken, i.e. the 
potential V{(pi) can have have a non-zero minimum, if one of the F^ acquires a non- vanishing 
vacuum expectation value {Fi) 7^ 0. 

For dcfinitcness, let us assume that the infiaton <f> is a supersymmetric field whose effective 
potential generically has the form (3.2), where the (Vo,m^,A) terms are present at tree-level. 
If the infiaton is one of the "flat directions" while supersymmetry is unbroken, the constant 
term Vb dominates and at first sight inflation may proceed with a very small mass arising from 
supersymmetry breaking effects. However, at loop order, an additional mass term is generated 
via the inflaton's coupling to the fields running around the loop. At fixed particle content of 
the chosen theory, however, these loop corrections are exactly calculable (and consequently, this 
should also be the case in string theory). An extensive review of possible models is provided 



dW 



(3.12) 
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in [31], and short discussions can be found in [24, 2, 106]. One may employ two strategies to 
render a quantum-corrected potential fiat enough for slow roll inflation: the superpotential can 
be dominated either by an F or by a D term [note that the latter is not the D term that gave 
rise to the kinetic terms of the scalar fields in Eq. (3.3), but a separate contribution to the 
superpotential which we discuss below]. These can be engineered to be flat, at least in a certain 
regime. We now consider one example each for two or three chiral multiplets, denoted $i,^>2 
and respectively, with their scalar components written as 01, 4>2 and ^q, <^_. 

F term inflation 

Consider the (renormalizable) superpotential for two chiral multiplets [107], 

WF{<t>i, = <t>i {Ml - M^) > (3.13) 
which, with the redeflnition = 4>i/V2 and Eq. (3.12), leads to the scalar potential 

Vf(0i, h) = 4>l \4>2\^ + |A0i -M^\\ (3.14) 

whose global minimum is at cpi = 0, (j)2 = M^/X. However, if 4>i is held flxed at a value exceeding 
the critical 4>i^^ = M'^/X, there is another minimum (for vanishing real and imaginary part of 
02) with V = [106]. Therefore, for 4>i > 4^^\ \ the held 4>i behaves like a flat direction 
with non-zero potential energy. Loop corrections generate a slope towards , and when 0i 
reaches its critical value, the field 02 rolls off towards the global minimum. Thus, the resulting 
inflationary model has the characteristics of hybrid inflation, where 02 is the field fixed during 
inflation, and (f)\ the waterfall point. 

However, in tiTic supcrgravity the "prescription" for obtaining the scalar potential V{(t>i) from 
the superpotential VF($) is more complicated than Eq. (3.12), namely it should be replaced by 
Eq. (3.9). In particular, the supcrgravity scalar potential has a prefactor V oc exp (kK), where 
K is the Kahler potential of Eq. (3.11). As a consequence, the potential slow roll parameter picks 
up a dependence rjy oc ifjj, where is the component of the field space metric in front of the 0j 
kinetic term. In the true vaccum of supcrgravity, this term should be canonical, i.e. Kij ~ 0(1), 
and during infiation it is not much smaller. This is referred to as the "r/ problem" of supcrgravity 
because the inflaton hence develops a mass (given by V") of {H'^) , which prohibits inflation. A 
possible way out is to use a Kahler potential of non-minimal form that makes r)v small enough 
[108]. 

D term inflation 

As mentioned earlier, one can write down a superspace expansion for a vector supermultiplet 
analogous to Eq. (3.1), making its spinor and vector components explicit. There is also another 
real pseudo-scalar auxiliary field D in this multiplet. If a theory contains both chiral and vector 
supermultiplets, the scalar potential F(0) has a so-called D term oc on top of the F term 

discussed earlier. This term is obtained from D oc 00 plus a possible Fayet Iliopoulos term, 
if a U{1) symmetry -like e.g. hypercharge Y~ is present. Unlike the F term potential, the D 
term does not receive an exponential prefactor involving the Kahler potential when supcrgravity 
corrections are introduced. It therefore does not suffer from the 77 problem and can be flat enough 
for slow roll inflation. 
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A superpotential from global supersymmetry including an additonal U{1) takes the form [109, 
110] 

Wd = A$o*+^- , (3.15) 

where the three chiral superfields have charges 0, ±1 under the {/(I) symmetry and A is a 
coupling constant. The scalar potential then has the form 

Vd{cPo,cP+,<I>-) = (|<^+</'-|' + + + ^ (|</'+|' - !</>- -?|')' , (3.16) 

where g is the gauge coupling and ^ is the parameter in the Fayet Iliopoulos term (here ^ > 0). 
The unique supcrsymmctric vacuum (where gauge symmetry is broken) is at <j)Q = (f)- = 0, ^+ = 
However, while 0o is in the regime |0o| > <7\/C/A, there is a minimum at (j)^ = (j)^ = 0, 
and the potential is flat at tree-level in the direction of 4>q (with V = g^S,'^ /2), while there 
is a "valley" in the (0+,(/)_) directions. At tree-level the F terms vanish in the inflationary 
field space direction. Loop corrections generate a small slope along leading to slow roll 
towards ^q'^'' = g\f^l2. The potential (3.16) therefore is a supcrsymmctric realization of hybrid 
inflation. A generalization of the superpotential (3.16) is obtained when the C/(l) symmetry is 
pseudo-anomalous as e.^. in string theory [106, 111]. 

We close this Chapter with two remarks. First, the fine-tuning constraints on the infiaton cou- 
plings (A, A„) can be lifted by relieving the infiaton from the duty of generating the primordial 
curvature perturbations. [Fine-tuning is necessary to ensure the predicted perturbation ampli- 
tude does not exceed the observed one of 0(lO~^), which is guaranteed by restricting inflation 
to the slow roll regime.] Instead, the perturbation spectrum may be generated by a different 
field, called the "curvaton", at the end of inflation [112, 113, 114]. 

In supersymmetry, many weakly (i.e. only gravitationally) coupled scalars arc available for this 
mechanism. If the curvaton decays late enough, its inhomogeneous perturbations can lead to 
fluctuations in the radiation density seeding the CMB temperature fluctuations. However, the 
decay has to be fast enough (within the first second) not to jeopardize SBBM evolution, which 
tightly constrains the possible mass range. Moreover, the resulting density perturbation is highly 
dependent on the evolution of a given mode k outside the Hubble scale. While inflationary model 
building is less constrained in the presence of a curvaton field, this mechanism also restricts the 
observational window on the microphysics of inflation. 

Second, there are (at least) two more cosmological puzzles one would like to see resolved in a 
GUT theory [9, 1]: it should contain feasible candidates for Dark Matter (of which there are 
many in supersymmetry), and it must be able to explain baryogenesis. A successful realization 
of reheating at the end of inflation together with baryogenesis can be used as a powerful criterion 
to discriminate between GUT candidates. 
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Extensions and Alternatives 



In this Chapter, we introduce two generalizations of the inflationary perturbation 
theory presented in Section 2.1: the possibility of multiple dynamic scalar fi,elds during 
inflation makes it necessary to consider the evolution of both adiabatic (curvature) 
and entropy (isocurvature) perturbations. We also discuss the scenario of k -inflation 
(Section 1.2.4) the perturbative level, which allows the calculation of fluctuation 
spectra for an inflaton field with non-canonical kinetic term. Finally, we briefly 
comment on alternatives to inflation. 



4.1. Multifield Inflation and Entropy Perturbations 



In Chapters 1 and 2 wc focused on scenarios with only one dynamic scalar during inflation, 
but attempts to identify the inflaton among the degrees of freedom of a (supersymmetric) Grand 
Unified Theory usually confront us with a multitude of fields 4>i- Several of them can contribute 
to the accelerated expansion if they are light compared to the Hubble scale. Each field undergoes 
quantum fluctuations 5(j)i, and therefore a qualitatively new form of perturbations arises: while 
a single field constitutes the energy density of the Universe, its perturbations are essentially 
equivalent to fluctuations in the energy density 5p, and hence of adiabatic (curvature^) type. 
However, in the multifield case where p depends on all ^j, different field perturbations (5^^ can 
combine in such a way that the overall density perturbation vanishes, 5p = 0, but the relative 
contribution of each to the total energy density p fluctuates. These perturbations are hence 
of entropy (or isocurvature) type. We briefly show how this new perturbation component can 
be calculated from the background evolution in multifield models. Details are readily available 
in the literature, see e.g. [115, 116, 117, 118]. 

For a collection of scalar fields ^j, i = 1, 2, . . . n, we have [instead of Eq. (1.25)] a set of i Klein 
Gordon equations, 

ij)i + ZH^i + ^V{^^,(j>2,...) = Q, (4.1) 

where the potential y is a function of all scalar fields at once, which can contain mass terms 
for certain fields, their self-interactions and notably cross-couplings between different Even 
in the absence of explicit interactions, however, the fields are coupled gravitationally via the 
multifield Hubble parameter, H"^ = (k/3) J^ii'P'i/'^) + ^ > where it has been assumed that all 
01 have canonical kinetic terms. Via the sum over all fields in the Hubble parameter, slow roll 
inflation may be possible even if the individual field potentials are steep. This mechanism is 
used, for example, in the "assisted inflation" scenario [119] as well as in "N-flation" [120]. 
With the field space now being n-dimensional, it is useful to assign a "weight factor" /j to each 



^Recall that d(f> and the metric variable 3> determine the comoving curvature perturbation 01 of Eq. (2.15). 
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<^X ^ Perturbation 




\ 



Background trajectory 



Figure 4.1.: In a two-field inflationary model 
with scalars {(f>, x) (both of which 
are light compared to the Hub- 
ble scale), the curved trajectory 
in field space can be decomposed 
into an adiabatic component a 
(along the trajectory) and an en- 
tropy component s (orthogonal to 
the trajectory). Both a and s 
have fluctuations, which give rise 
to the adiabatic and isocurvature 
perturbation modes, respectively. 
The angle 9 parametrizes the cur- 
vature of the background trajec- 
tory. (Figure from [115]) 



field, which is defined as [115] 



(4.2) 

,2 



measuring its contribution to the inflationary direction in field space. The direction a is obtained 
from integrating a = J dt J2i fi<t>i^ and it can be shown from Eqs. (4.1) that it evolves as 

dV 

i 

There are, however, (n — 1) more directions in field space which we may denote by Sj,j = 
1, . . . ,n — 1, and which can be chosen orthogonal to a (and relative to each other). In the 
following, we concentrate on the case n = 2 with the scalar fields (f) and % (and write si = s), 
for which the situation is illustrated in Fig. ??. 

We have hence exchanged the field space basis {(f), x) for tbe more convenient pair (a, s) along 
and orthogonal to the inflating direction. These new fields evidently can be decomposed again 
into a background and perturbations, a{t,x) = o"o(i) + S(T{t,x) and s{t,x) = 6s{t,x), where do 
obeys Eq. (4.3) and the entropy background field vanishes [115]. The perturbations 6s precisely 
give the isocurvature mode that the two-field scenario exhibits on top of the adiabatic mode^ 
da. 

In straightforward analogy with the single field case, the inherent gauge dependence of the 
adiabatic perturbation 6a requires a redefinition 5a^^^^ = 6a+{ao/ H) ip, while 6s is automatically 
gauge independent [121]. It can then be shown that the evolution of 6a^^^^ and 6s, respectively, 
may schematically be written as 

Sa^^^ + 3H 6a^^^ + " "'^) ^""^^^ = ^'^^ + ^^^^^ ' ^'^'^^ 

6s + 3H6s+('^-ml]6s = (4.5) 
V a2 y Kao a? 



^For the general case of n fields, the adiabatic perturbation is constructed from 5a = Yli fi^'Pi-: while for the 
entropy perturbation 5si = eij6<f)j, where Cij are coefficients such that Cjifi = 0.. 
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where ma,ms are the (time-dependent) adiabatic and entropic "mass" of the respective pertur- 
bations (calculated from weighted derivatives of the potential as well as other contributions) , and 
6 is a time-dependent factor (also consisting of background quantities) . The angle 6 describes 
the curvature of the inflationary trajectory in field space, sec Fig. ??. The detailed derivation of 
these equations is given elsewhere [115, 118]; here, we emphasize the following two observations: 
note that the only source term on the right hand side of Eq. (4.5) is oc ^' [where * is the Bardeen 
potential of Eq. (2.6)], and that it dies out on large scales when /c/a — > 0. On the other hand, 
Eq. (4.4) is sourced by terms oc 9 5s, hence if the inflationary trajectory is nontrivial in field 
space (^ 7^ 0), a non- vanishing entropy mode can feed the adiabatic curvature perturbation. As 
a consequence, the adiabatic mode no longer becomes constant on large scales (outside the Hub- 
ble radius), which is easily seen from the time derivative of the comoving curvature perturbation 
[compare the conservation law (2.78) for ( derived in the single field case]: 



Ji= ^ + — 96s (4.6) 



H k"^ ^ 2H 

-{ 

H (To 

Another way to see this is that for a single inflaton field the dominant Hubble friction term 
in its Klein Gordon equation during slow roll usually ensures that is quickly driven towards a 
unique attractor in phase space (^, which is independent of the field's initial conditions [122]. 
However, for multifield inflation the Hubble parameter depends on all fields simultaneously, 
and hence there are several possible trajectories in phase space depending on the set of initial 
field values. This ambiguity is responsible for the survival of non-adiabatic perturbations. 

Generalizing the slow roll regime to the two field case allows to obtain first order differential 

equations for Jo"^^'^ and 6s on large scales [123, 124], instead of the second order equations (4.4), 
(4.5) . The two modes of these first order equations can be described in terms of the curvature 
and isocurvature perturbations [115, 125] 

'R=^ 5(7(g') , S = ^ 5s , (4.7) 
Co ctq 

respectively. Since over many Hubble times, the slow roll approximation becomes less and less 
reliable, the evolution of 0? and S after Hubble crossing has to be taken into account by a transfer 
matrix. 



The first column in this matrix reflects the fact that i) adiabatic perturbations remain adiabatic 
if S = and ii) entropy perturbations are not sourced by adiabatic ones [see Eq. (4.5)]. 
In principle, instead of the sole comoving curvature spectrum obtained in single field inflation, 
three spectra have to be calculated in the multifield case: curvature and isocurvature perturba- 
tions as well as their correlation. They all contribute to the scalar spectrum and hence the 
observed temperature fluctuation ST/T in the CMB, see e.g. [126]. Notably, pure isocurvature 
perturbations would lead to CMB anisotropics six times bigger than their adiabatic counterpart. 
Therefore, observations exclude a pure isocurvature origin of the temperature fluctuations, but 
with their correlation taken into account, a mixed adiabatic and isocurvature origin of the ob- 
served ST/T can be feasible. 

Note that we have considered scalar perturbations only so far. Indeed, since they decouple from 
the matter sector at linear order, tensor perturbations are not affected by the presence of several 
scalar fields, and their spectrum remains unchanged. However, the ratio V^/V^ has changed 
and we now have 

V 

r=^c^ -8n^ sin^ A , (4.9) 
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where the additional factor measures for the cross-correlation due to Tj^ in Eq. (4.8), with 
cos A = T^js/ (l + r^§)^^^. Other observational consequences of isocurvature modes during 
inflation are, for example, non-Gaussianities, though they are of detectable size only if generated 
at the end of inflation [127, 128] or from inhomogeneous reheating [129, 124]. 



4.2. Perturbations in A;-lnflation 



In Section 1.2.4 we discussed the possibility of inflation from a generalized scalar field Lagrangian 
written as = p(X, 0), where X = {gni,d^(f)d^(l))/2. The function p{X,4>) plays the role of 
pressure, and the corresponding energy density was calculated in Eq. (1.32). Note that the GR 
sector of the theory is kept unchanged: we still use the Einstein equations (1.3), but their "right 
hand side", i.e. the energy momentum tensor, now reads [37, 38] 

T^, = ^ di,4>d,(l) -pg^, (4.10) 

with p and p given by their "fc-inflationary" expressions. It is then cumbersome but straight- 
forward to again push the Einstein equations to first order in perturbations with a generalized 
STfj^i, obtained from Eq. (4.10). Introducing the quantity^ 

o _ {dp/dX) _ (dp/dX) 

s {dp/dx) 2x {d'^p/dx'^) - (dp/dx) ' ^ ■ ^ 

one can show that the second order perturbed action fit for quantizing scalar perturbations in 
a universe filled with a fc-inflaton reads [38] 

S2S=IJ d'x l^v'^ - cl ViVi + ^ • (4.12) 

Here, we have assumed a flat universe /C = and the perturbations' "mass" term z"/z is now 
calculated with z = c~^/^ {acp' /%). Eq. (4.12) suggests interpretation of Cg as the "speed of 
sound" for the perturbations (where in standard perturbation theory Cg = 1), which is further 
confirmed by the equation of motion for the modes Vk after Fourier decomposition: 

v'l + cl k\k --Vk = (4.13) 
^ z 

It immediately follows that, provided the second term can still be neglected on large scales 
when —kr) — > 0, the solution of Eq. (4.13) obeys Ufc oc z in this limit. It is equally possible 
to calculate the perturbation spectra in an analogous fashion to the standard case of Section 
2.1, however, the "slow roll approximation" now includes the assumption of an adiabatically 
varying speed of sound Cg . Moreover, the freezing of modes now occurs when a given scale k 
satisfies c^k = aH (which can be understood as "sound horizon crossing" ) . The derivation of k- 
inflationary power spectra and their indices using the so-called uniform approximation is part of 
the results presented in Chapter 13, and will be discussed there, along with its consequences for 
recent CMB observations. At zeroth order, the amplitude of the scalar perturbation spectrum 
picks up an additional factor of 1/Cg , 

= ( T ) ' (4-14) 
\7rmi,eicJ ^^^^ 



^If Cg < 0, the theory is violently unstable already at the background level and therefore these models do not 
have physical significance [37, 40]. 
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where the quantities on the right hand side are evaluated at the sound horizon crossing of the 
pivot scale fc*. Since tensor perturbations are decoupled from matter at linear order, the tensor 
modes still obey Eq. (2.29). Therefore, their spectrum does not change with respect to the 
standard case, which has an important consequence for the single field consistency condition: it 
now reads r = —Sc^n^. Recall that in the last Section, we also concluded that multifield scenar- 
ios can be distinguished observationally from the standard single field case by their prediction 
for r. A measurement of tensor perturbations therefore promises to discriminate between the 
minimal inflationary perturbation theory of Chapter 2 and its extensions presented in this and 
the previous Section. 

We saw in Chapter 3 that supersymmetry inspired models of inflation with several scalar fields 
can have a non-trivial metric in field space K„in, depending on the form of their Kahler potential 
K{(j), (p). In the following, we shall find that multiple fields with non-canonical kinetic terms are 
also common in string theory. The derivation of inflationary spectra in these scenarios is quite 
involved and requires the combination of the extensions presented in Sections 4.1 and 4.2. For 
recent work in this direction, see [130, 131, 132] and references therein. 



4.3. Alternatives to Inflation 

Generic predictions of the inflationary scenario (see Section 2.3) pass all observational tests to 
date and have made it the major paradigm for the very early Universe. There are however, 
loose ends to the inflationary story: despite many attempts, it remains difficult to identify a 
good candidate for the inflaton field from viable particle physics theories. Moreover, in some 
scenarios the problem of initial conditions rears its ugly head: by design, inflation solves the 
initial conditions issues behind the SBBM's flatness and horizon problems, but e.g. within the 
class of small field models, fine tuning of the initial field value and velocity of the inflaton can 
be necessary to achieve enough expansion. 

Most importantly, however, inflation is not a solution to the "initial singularity problem": in 
an expanding universe described by GR and filled with matter sources that obey the weak 
energy condition, a singularity in the far past is inevitable [133, 134]. When approaching this 
singularity, our description of spacetime must be radically different. An inflationary period may 
emerge from this (like the SBBM emerges after the end of inflation), but other options can be 
considered. We conclude this Chapter by briefly citing some alternatives to inflation. 

String gas cosmology 

The string gas scenario for the early Universe [135, 136, 137] uses ideas from superstring theory 
(to be discussed in the following Chapters), which predicts a ten-dimensional spacetime. Six 
spatial dimensions therefore have to be compactificd to recover our four-dimensional Universe, 
and string gas cosmology resolves the question "Why three extended spatial dimensions?" by 
studying the annihilation of string winding modes around dimensions that are initially curled 
up with radius R. 

Due to the property of "T-duality" (which relates the compactification on radius R with the 
one of inverse radius 1/R and is discussed in Section 6.3), strings in thermal equilibrium have 
a maximum temperature Th, the Hagedorn temperature. Therefore, even when a gas of strings 
is further and further compressed while we follow the early Universe backwards in time, a 
temperature singularity is avoided. For large R, only the momentum string modes contribute to 
the energy density^, and they behave like radiation, hence the initial phase of SBBM evolution 

''There are three types of modes, momentum, oscillating and winding ones. Momentum modes correspond 
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is recovered. It must be checked whether the flatness and horizon problems are still solved 
for the resulting universe (which directly passes from the Hagedorn to the radiation phase 
without an intermediate stage of inflation). Other key issues to address in this framework 
are the stabilization of moduli fields (see below), and the generation of perturbation spectra 
[138, 139, 140]. 

EKPYROTIC / CYCLIC UNIVERSES 

In an ekpyrotic universe [141, 142], the horizon problem is addressed by a period of slow contrac- 
tion in the far past (before the Big Bang). The perturbation spectra are generated during this 
contraction leading to a "Big Crunch" , from which the Universe then emerges into the SBBM 
epoch. (In its extended version of this model, several subsequent contraction and expansion 
phases are considered, leading to the terminology of a cyclic universe [143].) These scenarios 
can be motivated from the collision of branes in string theory (see following Chapters). Again, 
it is crucial to check whether the resulting perturbation spectra can have the correct {i.e. nearly 
scale invariant) shape to match observations. 

Pre Big Bang and bouncing universe 

In the pre Big Bang scenario, one assumes that the Universe never experienced a singularity, but 
initially was in a state of finite maximum curvature [144]. Like the ekpyrotic model, this can be 
motivated from string theory, and it is found that the Universe underwent a contracting phase 
in the past, passing through a bounce before it emerged into the present period of expansion 
[145, 146, 147]. The minimum radius of the Universe at the bounce {i.e. the maximum curvature) 
then introduces another length scale into the problem. Perturbation spectra generated before 
the bounce and propagated through it generically experience a mixing of the pre-bounce k modes 
in the following expanding phase [148]. 



to degrees of freedom also available to point particles (the center of mass string motion), while oscillatory 
(corresponding to fluctuations of the string) and winding (wrapped around extra dimensions) modes are 
intrinsically stringy. 
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We discussed in Chapter 3 that inflation may he connected with unifying theories 
of high energy physics. A promising contender for a description of all interactions 
including gravity is string theory, according to which the fundamental constituents 
of Nature are oscillating strings instead of pointlike particles. In this Chapter, we 
introduce the basic concepts of string theory and establish its connection with a su- 
persymmetric effective field theory in ten dimensions. 

String theory originally emerged in the 1960s as a framework for the strong interaction prior to 
the development of quantum chromodynamics. It was then realized that the massless spin-2 state 
in the string theoretic particle spectrum can be interpreted as the graviton, and therefore string 
theory could be seen as a quantum theory of gravity. Today five supersymmetric formulations 
of string theory in ten-dimensional spacetime are known, closely related among themselves by 
so-called duality transformations. These relations suggest that they are not five distinct theories, 
but rather different branches on the tree of a single underlying framework. 

String theory explores the consequences of the bold assumption that, instead of elementary 
pointlike particles, the Universe is filled with tiny strings of characteristic length ^g- Just like 
vibrations of a violin string produce different sounds, the various particles we observe in Nature 
then are associated with distinct oscillations of the fundamental string. However, there is only 
one string type, and therefore all particles are described by the same theory [149]. The string 
length is is the only (dimensionful) input parameter in string theory. All other (dimensionless) 
parameters like coupling constants etc. are determined by the vacuum expectation values of 
scalar fields. This is in sharp contrast to the situation in the Standard Model with its 0(20) 
adjustable dimensionless parameters, and is commonly taken as a sign that string theory can 
provide a fundamental (not just an effective) description of physics. Many textbooks discuss 
string theory from the introductory to the expert level [149, 105, 106, 150, 151, 152, 153, 154]. 



5.1. The Bare Necessities 
5.1.1. The Bosonic String 

A relativistic particle following a (one-dimensional) trajectory T in d-dimensional spacetime has 
the degrees of freedom x™'(^) (where m = 0, 1, . . . , d— 1, and ^ is a convenient parameter along T) 
and is described by the action ^particle = -~fn /yds. (Recall that we set c = 1.) The parameter 
m is the mass of the particle, and ds is the line element obtained from the d-dimensional gener- 
alization of Eq. (1.1). Note that one may write ds as ds = d^ sj gmn (dx'^/d^) (da:;"-/d^), where 
we have used = ^, the remaining (where now i = 1, . . . , (i — 1) being spatial coordinates. 
Hence, for d = 4 with = (t, x) in Minkowski space (i.e. with the metric gfj,u = "^^u/), we clearly 
obtain the familiar point particle action of Special Relativity, ^Sparticie = —m — 'ip. 

A one-dimensional object like a fundamental string traces out a (two-dimensional) world sheet 
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Figure 5.1.: Left: A pointlike particle traces out a d = 1 trajectory (its "world line") in spacetime, while a 
one-dimensional object like a fundamental string has a d = 2 "world sheet" . (Consequently, 
a closed string has a "world tube".) This can be generalized to higher dimensional objects of 
string theory like p-branes (with p spatial dimensions) whose action is proportional to their 
(p + 1) dimensional "world volume" . Right: The ends of open strings can join to either form 
a longer open string, or (if two ends of the same string meet) a closed string. Therefore, a 
theory of only open strings cannot be consistent and closed strings must always be included. 
Closed strings can join to form larger string loops. 



S in spacetime (see Fig. 5.1), hence we need two parameters (^^,^^) to describe it. Suppose the 
string action is proportional to the area^ of E, 



dX"^ dX' 



Wng = -T I d^M^^ ^/ - det ( 9mn-^^-^ ] , (5.1) 



where T is a (dimensionful) parameter called the string tension, and the determinant under 
the square root runs over the coordinates on the world sheet, ie.it may be understood as the 
"induced world sheet metric" 7^6, 

QXm Qxn 

l/ab — 9mn-Q^-r^ ■ (5.2) 

This (2 X 2) metric is also referred to as the "puUback" of the spacetime metric gmn onto the 
world sheet. Until further notice, we implicitly set gmn = flmn: with rjmn the d-dimensional 
Minkowski metric. 

The coordinates X"* (^^,^^) are the (bosonic) degrees of freedom of string theory. For definite- 
ness, we fix the world sheet coordinate system as = t, = a, where cr is a spatial coordinate 
running along the string and r is the string's proper time. Without loss of generality, one may 
restrict a to the interval a = [0, 27r], while r runs between some initial and final instants of world 
sheet time, r = [Ti,T/]. Therefore, we may rewrite Eq. (5.1) as 

I^T^ f'2n 

^string = -r / dr / da^/^, (5.3) 

JTi Jo 

where 7 = det7af,. In the above form the string action is called the Nambu Goto action. The 
string tension T (with dimensions of mass per unit length, i. e. mass squared in our units) is but 



^As is conventional in string theory, we now use capital letters X"^ for the spacetime coordinates in Eq. (1.1) 
since they really are "mapping functions" X"^{^^,^^) [149], ie. they describe the spacetime embedding of the 
string. 
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a rewriting of the stringy length scale is, 

where a' = is called the Regge slope^. Quantization of the string action in the form (5.1) is 
difficult. At least at the classical level, there exists the alternative Polyakov formulation, where 
the (2 X 2) world sheet metric is used as an auxiliary field hab, 

^string = -\fJ dr j^J daV^ daXm dbXn ■ (5.5) 

Varying Eq. (5.5) with respect to and hab, we obtain the equations of motion and the world 
sheet energy momentum tensor Tab, respectively: 

da(y^h''''dbX"'^ = (5.6) 

Tab = -daX"'dbXm + \habh'"^dcX^ddXm = (5.7) 

Note that as an auxiliary field, hab has no kinetic term in Eq. (5.5), hence its equation of 
motion corresponds to the vanishing of the energy momentum tensor, Tab oc SSstrmg/ Sh"-^. By 
construction, the action (5.5) has Poincare invariance in d spacetime dimensions and is invariant 
under local reparametrizations of S, i.e. the world sheet experiences two-dimensional gravity. 
There is, however, an additional conformal invariance on S, corresponding to a coordinate- 
dependent rescaling of hab (also called Weyl rescaling). Together, world sheet reparametrization 
and conformal invariance allow us to fix a gauge such that hab = Vab = diag(l, —1), i.e. the world 
sheet is flat. This is called the "conformal gauge", in which the equations of motion Eq. (5.6) 
take a particularly simple form: 

= - = (5-8) 

Note that, since we have already fixed the world sheet metric, the vanishing of the energy mo- 
mentum tensor (5.7) now amounts to constraint equations imposed on the solutions of Eq. (5.8). 

Moreover, when we derived Eq. (5.6) from (55string/^-^"^ = 0, we quietly set surface terms 
to zero, which amounts to boundary conditions for the X"^. These can, separately for each 
m = 0, 1, ... d — 1, be of two distinct types: 

• For closed strings, X"*(r, a -|- 27r) = X"^{T,a), so-called Dirichlet boundary conditions. 

• For open strings, dX"^/da = at cr = 0, cr = 27r, so-called Neumann boundary conditions. 

Only Neumann boundary conditions allow to conserve Poincare invariance for open strings and 
ensure that no momentum is flowing through the ends of the string. Later, however, it was 
realized that open strings can also have Dirichlet boundary conditions if they are attached to 
dynamical objects called D-branes. We come back to this in Section 6.1. 

^Historically, the parameter a' described the relation between the angular momentum of a rigidly rotating open 
string (in units of h) and its mass squared, i.e.J/h oc a'm^. Hence, in plots of J/h vs. rr? used to study 
hadronic excitations (recall that string theory started out as a theory of the strong interaction), the parameter 
ct measured the slope. 
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Closed strings 



The classical solution of Eq. (5.8) in the closed string case X^^^^^{t, a) can always be written as 
a sum of "left-movers" [functions of (r + a) only] and "right-movers" [functions of (r — a) only] , 

^cTosed(^> ^) = Xl^nir + X,Tght(r - a) . (5.9) 

As is readily demonstrated in the literature, X^^^ and X^^^^ contain a constant and a linear term, 
plus an infinite tower of oscillatory modes with hermitian coefficients and a^, respectively^: 

X^dr + a) = ^ + ^(r + a)+zJ|5;^e-(^+^) (5.10) 

X^,uir-a) = ^ + «^(r-a)+zJ|j;^e— (5.11) 

With these solutions in hand, we proceed to imposing the constraints of Eq. (5.7) in the case of 
fiat world sheet metric. Given the separation of Eq. (5.9), it is useful to define components r++ = 

^ (Too + ^oi) and T = | (Too — Toi) of the energy momentum tensor, which depend only on 

X^^ and ^^ijt' respectively. The closed string solutions (5.9)-(5.11) then must guarantee 

r+_|_ = T = 0. (The mixed componentes and r__|_ vanish identically because the energy 

momentum tensor is traceless.) 

Classically, there is no obstacle to these constraints. However, to obtain a quantum theory, the 
coefficients (a™, a™) are promoted to the status of (hermitian) operators with the standard 
harmonic oscillator commutation relations (see Section 2.1.1). To impose Eq. (5.7), one then 

uses the Fourier components of T+-|- and T related to the oscillator creation and annihilation 

operators as (for now, the index k ^ 0) 

Lk = 2r rdae^*^(---)r__ = J a^_,a^„, (5.12) 

•^0 ^ n=-oo 

Lfe = 2T rdae^^(-+-)r++ = 2 £ <_n«mn. (5.13) 

•^0 n=-oo 

Here, the oscillator operators are hermitian with (a^)^ = a^n^ i^n')^ = ^^n- The Lj^jL^ 
constructed from them are called the Virasoro operators, and the rationale behind their definition 
is that r++ = T = holds true if the physical states | tp) of the theory obey^ 

Lkl^p) =Lk\ip) =0, k>0. (5.14) 

For k, k' 7^ 0, one may obtain the Virasoro algebra from Eqs. (5.12) and (5.13) and the (a^, a^) 
commutation relations to read 

[Lk, Lk>] = {k - k')Lk+k' , Lk,Lk' = {k - k')Lk+k' , Lk,Lk' =0. (5.15) 

For A; = 0, there is an ordering ambiguity in the definitions (5.12) and (5.13), and if one defines 
normal ordered operators as Lq"°^ = ^ Yl^=-oo ' (^^n^mn '■ (and likewise for Lq), this ordering 
ambiguity is illustrated by a (formally infinite) constant a appearing in the Hamiltonian, 



H 



= (4"°) + L^^"^ - 2a) . (5.16) 



^The index n here and in the following runs over the modes, while m still denotes the dimensions of spacetime, 
m = 0, 1, . . . d — 1. 

*For negative k, using Lk = il^, one still has {^p\Ln\i-p) = 0, meaning that states of zero norm occur but decouple 
from the positive norm physical states. 
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Likewise, the generalization of the algebra (5.15) to arbitrary k, k' is 

[Lfe, Lk'] = {k- k')Lk+k' + k{k'^ - l)5k+k'fi , (5.17) 

where we have simplified the notation setting L^^'^^ '■= Lq, and an analogous replacement for the 
second commutator in Eq. (5.15). 

Two steps allow to understand the role of a and d and determine their values. Firstly, one can 

show that the conformal gauge hab = Vab still left some residual gauge freedom, i. e. not all of 
degrees of freedom X"^, m = 0,1, ... d — 1, really are physical. In order to limit ourselves to 
propagating degrees of freedom only, we define two so-called light cone coordinates from the 
combinations 



= j=(x^±X'-^) , (5.18) 



while the remaining X^, i = 1, . . .d — 2, are kept as before and shall now be called the transverse 

degrees of freedom. [When we defined r++ and T before, these were the components of the 

energy momentum tensor in the light cone gauge of Eq. (5.18).] The residual gauge freedom 
permits to fix X~^ in an essentially trivial way (i.e. without oscillator components), X^ = Xq + 
a'p'^T. As a consequence, X~ may be expressed as a function of the transverse X^ only, and we 
hence only need to find solutions of the type (5.9)-(5.11) to the remaining {d — 2) equations of 
motion for the X^. 

The bosonic closed string state space can be built from the vacuum |0, k) at given momentum 
k, which is annihilated by both the (transverse) and the (where n > 0) and for which 

p+\n,k) = k+\n,k), p'\n,k) = k'\n,k), (5.19) 

with the "light cone momentum" p+ and transverse momentum p (whose components are the 
p^). We act on \n, k) with left- and right-mover creation operators and a!_„, which leads to 
physical states (in the light cone gauge) with masses 



/ 2 

^closed 



4 ( ^ : a*_„ai„ : -a j , i = 1, 2, . . . d - 2 . (5.20) 



\n=l 



In this expression a plays the role of the "vacuum energy" that arises when normal ordering is 
carried out, and one has 



d-2 A d-2 ^, _ d-2 



2 2 ' 24 ' 

n=l 

where the analytic continuation of the C, function, C,{s) = Yl^=i to s = —1 has been used, 
with C(~l) = —1/12. This procedure is called function rcgularization" , and it has allowed 
us to express a in terms of the spacetime dimension d in Eq. (5.21). 

Secondly, it turns out that d in bosonic string theory is not a free parameter, but must be fixed 

to avoid quantum anomalies in the commutators. These anomalies would destroy spacetime 
Lorentz invariance after quantization of the theory, and it can be shown that they are avoided 
if c? = 26, which consequently gives a = 1. 

Wc now turn to the mass spectrum predicted by Eq. (5.20). For the lowest-lying level of the 
closed bosonic string it follows that m^i^^^^^ < 0. We come back to the role of this closed string 
tachyon (and its open string analogue) below. For now let us note that as a consequence of 
shift invariance along the a direction on the "world tube" (see Fig. 5.1) left- and right-movers 
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give equal contributions to the mass, rwfgfj = ''Tirighf ^^'^ hence states with mdosed = are only 
obtained if both mieft and mright are zero. (These massless states are of special interest since 
they appear in the effective supergravity description of string theory, see below.) A general 
massless state may be written as 

^ Rij a^Xc^_\\n, k) . (5.22) 

ij 

The matrix [R]ij may be decomposed into a symmetric and traceless part [S]ij, an anti-symmetric 
part [A]ij and a trace t multiplied by the identity matrix I. The states described by Eq. (5.22) 
accordingly belong to three different categories: [S]ij gives rise to the spin-2 field hij, which is 
interpreted as the d-dimensional graviton gmn- Note that this state has emerged without putting 
in any spacctimc gravity at the classical level. [A]ij describes the Kalb Ramond anti- symmetric 
tensor field B jjifi in d dimensions. This two- index field Bfjin 

may be thought of as the stringy 
generalization of the four-vector potential in electrodynamics, which couples to the world 
line element dx^. (Indeed, we shall see that Bmn couples to the two-dimensional world sheet 
element dr da in an analogous way.) The trace t is a scalar degree of freedom called the dilaton 
We come back to these two companion states {Bmn, ^) of the graviton below. 

In distinguishing left- and right-movers for the closed string, we have assumed that its (tube- 
like) world sheet is oriented. However, closed strings can be made unoriented using the world 
sheet parity operation O,, which takes : a — > 27r — a, r — > r (i.e. it exchanges the left- and 
right-moving modes). In a theory of unoriented closed strings, only states invariant under Q are 
kept. In the case of the massless degrees of freedom, the graviton as well as the dilaton survive 
in the unoriented theory, while the Kalb Ramond tensor field is projected out. 



Open strings 

For open strings, due to the Neumann boundary conditions, left- and right-moving parts in 
X™g^(r, 0") are not independent, hence there is no separation like in Eq. (5.9), and only one set 
of oscillator coefficients is needed. The mode expansion replacing Eqs. (5.10) and (5.11) in 
this case reads 

^open(^> ^) = Xo+ '^C('p"'t + iV2^ ^ e'"'^ cos{na) . (5.23) 

A similar quantization procedure involving the definition of (one set of) Virasoro operators goes 
through, and notably anomalies are again avoided in d = 26 dimensions. Using the light cone 
gauge, one may again obtain a mass formula 

oo 

«' "iopen = Yl '■ «-n«in : -1 • (5.24) 
n=l 

Once more we postpone the discussion of the tachyonic ground state for the moment. In this 
case, the first excited state a^_i\i},k) gives a massless vector field with 24 degrees of freedom. 
(We do not pay attention to the higher-lying massive states since they do not enter in the 
effective field theory description we shall use eventually.) It is intuitively clear that open strings 
also have an orientation since their endpoints are distinct. A world sheet parity transformation 
therefore identifies the endpoints, making the open string unoriented. 

To summarize, the bosonic string action (5.5) describes the coupling of d = 26 scalar fields 
X"^(r, (j) to two-dimensional gravity on the world sheet. The resulting particle spectrum after 
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quantization contains only spacetime bosons, and among the low-lying (closed string) states, we 
encountered the graviton along with the Kalb Ramond field and the dilaton. However, both 
the spectra of the closed and of the open string contain a state of negative mass. While the 
open string tachyon has been understood in terms of the decay of D-branes (see Section 6.1), the 
closed string tachyon remains pathologic. Moreover, we cannot be satisfied with a theory of only 
bosons, given that matter in the real world is made of fermions. This leads to the introduction 
of superstrings, which we now discuss. 

5.1.2. The Superstring 

In Chapter 3, we gave arguments in favour of an underlying "supersymmetry" between bosons 
and fermions in the realm of GUT energies, though this symmetry must be broken in the 
Universe at present. Since string theory is aimed at describing physics beyond the Standard 
Model, supersymmetry may be part of it. This suggests the introduction of word sheet spinors 
'^"^{t, a) as superpartners for the X"^{t, a) coordinates in a supermultiplet^. Both fermions and 
bosons should then be coupled to two-dimensional (super-) gravity on the world sheet. Note that 
the theory obtained in such a way has world sheet supersymmetry by construction. Spacetime 
supersymmetry is only obtained after a suitable projection (see below), and is never explicit in 
the formalism (called Ramond Neveu Schwarz) presented here. Spacetime supersymmetry can, 
however, be made manifest using the alternative Green Schwarz formalism. 

A reasonable starting point is the action 

'^Serstring = j d^'d^' ^ (h^'daX^^ d,Xm + i^^p-'da^m) , (5.26) 

where the matrices {p^,p^) are the 7-matrices in two dimensions with {p"',p'^} = 2r/"^I. Indeed, 
this action has the familiar world sheet reparametrization invariance from before^, plus an addi- 
tional (on-shell) global world sheet supersymmetry relating and X"*. One can show that the 
commutator of two successive such transformations acts as a world sheet translation on ^'"^ and 
X™", respectively. However, when supersymmetry is made local by taking the infinitesimal two- 
component Majorana spinor parameter (" as world sheet dependent, — t- ("(r, a), a supercurrent 
appears when varying the action ^g^pg^.g^j.^j^g of Eq. (5.26). It may be cancelled by introducing 
a two-dimensional "gravitino" Xa (the superpartner of the graviton, with appropriate transfor- 
mation properties), 

'^Serstring = " ^ / d^'d^' ^ Xap'p^^"" d,X^ . (5.28) 

However, to make the total superstring action invariant under local supersymmetry transforma- 

^In terms of an expansion in component fields analogous to Eq. (3.1), this superfield reads 

y'"(o-",6i) = x'"((t") + ^*'"(o-") -I- ^ eeB"'{a") , (5.25) 

where a" = (r, a) and the {9, 9) are the supcrspace coordinates on the world sheet. The field is again an 
auxiliary field that allows to close the world sheet supersymmetry algebra off-shell. 
®In terms of the superstring superfield of Eq. (5.25), this action may be written as 

'^i^^e.tHng = ^ / d\d'e D^Y- D^Yr^, = + ^^'^'^^^ ^" " ^^'^^^ 

The auxiliary field B'^ is found to obey the equation of motion B^ = and can be eliminated. 
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tions at first order, still one more term is required: 

'^Serstring = / d^M^^ ^ ^^^r^XapV^Xb (5-29) 

Then, the sum of Eqs. (5.26), (5.28) and (5.29) is invariant with respect to local world sheet 
supcrsymmetry transformations at first order, i.e. we have introduced two-dimensional super- 
gravity on the world sheet. 

The conformal invariance that, together with world sheet reparametrizations, allowed us to 
choose a flat world sheet metric survives the generalization from bosonic to superstrings and 
we may still take hab = Vab- What is more, the two-dimensional world sheet supersymmetry 
together with an additonal "superconformal" invariance of the superstring action may be used to 
set the gravitino to zero, Xa = 0. This is the analogue of the conformal gauge for the superstring, 
in which the equations of motion for ^'"^ and X'^ simply read 

dad^'X'^ = , ip^da^"^ = . (5.30) 

Again, the remaining equations of motion (for hab ^'iid Xa) turn into the constraint equations 
Tab = 0, J" = once the supercovariant gauge choice is made, where the energy momentum 
tensor Tab ^^nd the supercurrent J" are given by 

Tab = -daX'^dbXm-\^'^{padb + Pbda)'^m 

+ M (^99 dgXm + ^"^f^dg^rr^ , (5.31) 

r = ^p'p'^-^^dbX^. (5.32) 

Boundary conditions for the solutions of Eqs. (5.30) are again obtained from the requirement of 
vanishing surface terms when varying the superstring action. For the bosonic degrees of freedom 
X™, these are unchanged from before. For the newly introduced spinors, note that one may 
write 

= ("^i^*) , ^™ ht = ^nght(^ - Kft = Kft(T + a) , (5.33) 

where for now the separation into left- and right movers is a tool: using the ansatz on the left 
in Eq. (5.33) in the equation of motion (5.30), we see that Eq. (5.30) reduces to 

(^ + ^)*'4m = 0, (^-^)<,.=0. (5.34) 

This justifies the expressions on the right in Eq. (5.33). There are then also the same two 
boundary choices, i.e. open or closed fermionic strings. 

For closed strings, one may impose periodic (referred to as Ramond) and anti-periodic (referred 
to as Neveu Schwarz) boundary conditions on *J^f^ and ^^fg^t separately, i.e. choose one sign 
each in 

*I^ft(r,a) = ±^]^^{T,a + 27r), (5.35) 
*right(^,'^) = ±*Hght(^><^ + 27r). (5.36) 

Hence, there are four possible combinations of boundary conditions for closed fermionic strings, 
and the state space can be divided accordingly into the Ramond-Ramond (R-R), Ramond- Neveu 
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Schwarz (R-NS), Neveu Schwarz-Ramond (NS-R) and the Neveu Schwarz-Neveu Schwarz (NS- 
NS) sectors. It can be shown that in spacetime, the R-R and NS-NS sectors give rise to bosons 
eventually, while the mixed sectors contain spacetime fermions. 
For open strings, the boundary conditions are 

*i?ft(^>^ = 0) = *^ght(^,'^ = 0) (5.37) 
*right(^,'^ = 27r) = ±*^ght(^,^ = 27r). (5.38) 

The overall relative sign in the first line, i.e. for a = 0, is a matter of convention, but the 
relative sign at the other end (the second line, a = 27r) is meaningful. If the "+" sign is 
chosen (periodic, ie. Ramond boundary conditions), it can be shown that the resulting states 
are spacetime fermions. If one opts for relative "— " sign at the other end (that is, anti-periodic 
or Neveu Schwarz boundary conditions), one recovers spacetime bosons. 

The equations (5.30) may then again be solved using an oscillator expansion and imposing 
the above boundary conditions. As an example, let us cite the mode expansion for the right- 
moving part of the closed superstring with Ramond and Neveu Schwarz boundary conditions, 
respectively: 

Kht]R=E^™«"'''"'^ Kht]NS= E ^re-^^^"-'^) (5.39) 
feez rez+i/2 

Note that k in the left expression runs over all integers, while the index r on the right runs over all 
half-integers. For the left-movers, corresponding expansion coefficients (7™, ^™) are introduced. 
Once promoted to operators, these coefficients now satisfy ani«-commutation conditions 

hk, Yi} = W, in = Sk+i,o r?™" , (5.40) 
m = {^r\ 4"} = Sr+s,o ry'"" , (5.41) 

for the Ramond and the Neveu Schwarz cases, respectively. The (half-)integer character of the 
indices (r, s) and {k, I), respectively, suggests that, as mentioned above, states from the R-R and 
NS-NS sectors are spacetime bosons, while fermions arise from R-NS and NS-R. For the open 
string, left- and right movers are not independent (as it was the case for the bosonic string), 
and therefore one set each of (Ramond) and (Neveu Schwarz) suffices. 
In our analysis of both the bosonic and the fermionic string, we encountered the property that 
closed strings -due to their subdivision into left-movers and right-movers- required two copies of 
the oscillators and (/3^,7™), respectively, as opposed to one copy for open strings. In fact, 
one may think of the state space of closed strings as a direct product of left- and right-movers, 
each of which has the same structure as the open string spectrum. We therefore restrict our 
considerations to open superstrings with operators a™ for the bosonic X"^ and (/3™, 7^.") for the 
fermionic (with its two sectors R and NS) in the rest of this Section. 

To impose the resulting constraints of Eqs. (5.31)-(5.32), it is again useful to introduce Virasoro 

operators as the Fourier components of T+_|_, T and J+, J_, respectively^. The superconformal 

invariance on the world sheet then demands that these components as well as the supercurrents 

vanish, T+-|_ = T = J+ = J_ = 0. The details are readily available in the literature [105, 151]. 

It is again possible to prohibit ghosts {i.e. states of negative norm) by postulating conditions for 
physical states using the super- Virasoro operators. The fermionic analogues (or and onS; one for 
each sector) of the bosonic ordering constant a may be expressed as functions of the spacetime 
dimension d. From the consideration of quantum anomalies, it follows in this case that Lorentz 

'^The latter are defined as J+ = ^i^a d+Xm.\e!t and J- = ^'nght 9- X^, right, where d± = dr ±da. 
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invariance is preserved if d = 10 (and or = 0, ons = 1/2)- Superstring theories therefore live in 
ten dimensions. Depending on the boundary conditions (i.e. in its different sectors introduced 
above), they can describe spacetime fermions or bosons. The construction of the spectrum from 
the NS sector vacuum state \Q, A;)ns (which is a scalar and annihilated by both and /?* for 
n, r > 0) and k)^ in the R sector (annihilated by aj^ and 7* for n,r > and of spinor type) 
is discussed in detail e.g. in [151]. Here, we state the resulting mass formulas in the light cone 
gauge, i.e. using the transverse oscillators i only (normal ordering is implied): 

00 00 J 

n=l r=l/2 
oo oo 

a'm| = ^a'_„ai„ + ^n7l„7i„ (5.43) 

n=l n=l 

In the NS sector, the lowest-lying state again is a tachyon, as we see from Eq. (5.42). Note that 
the index r is half-integer valued and therefore the first excited (massless) state is obtained by 
acting on /c)ns with /31^y2- ^ spacetime vector because it arises from a vector operator 
acting on a scalar ground state. (The bosonic oscillators only contribute to the higher 
excitations we do not consider.) The lowest state in the R sector is a massless spinor. 



The GSO projection 

We saw that a Lorentz invariant, ghost free superstring theory of open (and closed, see below) 
strings can be formulated in ten dimensions. The resulting particle spectrum contains both 
spacetime bosons and fermions. There are, however, two points we glossed over so far: firstly, 
supersymmetry on the world sheet does not imply spacetime supersymmetry. Secondly, as is 
readily seen from Eqs. (5.20), (5.24) and (5.42), massless states are not the lowest ones in the 
spectrum: both bosonic and superstring theory contain tachyonic states of negative mass, which 
means that their vacuum state is unstable. The persistent closed string tachyon lead us to 
abandon bosonic string theory, but luckily the situation is different for the superstring: it turns 
out that the superstring NS tachyon may be removed, and moreover spacetime supersymmetry 
in ten dimensions achieved, by applying a projection due to Gliozzi, Scherk and Olive (GSO). 
The GSO projection operators for the NS and the R sectors are defined from 

^^s = Ml + (-1)^-^+^] , Pa = i [l + (-1)^^+1] , (5.44) 



2 

= 1/2 



where Pns = l^^-^rPir counts the (3 oscillator excitations (i.e. it is the world sheet fermion 



number). Likewise, Fr = '}2^=il-nlin iii the Ramond sector. The additional parameter r/ in 
the Ramond operator Pr can be chosen ±1, depending on the chirality of the spinor ground 
state |f2,/c)R. In the NS sector, the GSO projection therefore removes all states with an even 
number of world sheet fermions, and notably the NS tachyonic ground state is projected out. 
In the R sector it is a matter of convention whether an even or odd number of 7 oscillations 
are kept, and the two choices give rise to different superstring theories, as we discuss below. It 
can be shown that after the GSO projection, the number of fermionic and bosonic degrees of 
freedom is equal at each excitation level. This is necessary for, but not a proof of spacetime 
supersymmetry. One may, however, show explicitly that the theory is supersymmetric in ten 
spacetime dimensions after the GSO projection if the alternative Green Schwarz formalism is 
employed (see e.g. [151]). 
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5.1.3. Five Theories 



The last part of the above discussion was carried out for open superstrings only. However, 
the endpoints of open strings can meet and form a closed string (see Fig. 5.1), which is why 

a consistent theory cannot be built from open strings (and their R and NS sectors) alone. 
Combining two copies of the open string state space (one for the left- and one for the right- 
movers), we can construct the closed string spectrum with its R-R, R-NS, NS-R and NS-NS 
sectors. Note that in Eq. (5.44) we have the choice of either projecting onto states with odd 
or even (depending on the chirality of the ground state, which we write |+)r or |— )r, 
respectively). This choice can be made independently for left- and right- movers, and their R 
sectors then have the same or opposite chirality. The former choice gives rise to the so-called 
type IIB superstring theory^, while the latter defines type IIA superstring theory. 
This distinction along with two other criteria whether a theory contains open strings on top of 
closed ones, and whether its fundamental strings are oriented^ or not- allows the identification 
of five distinct superstring theories referred to as type I, the above-mentioned types IIA and 
IIB, and the heterotic SO{32) and Eg x Eg theories. 

Theories of oriented closed superstrings: type IIA and type IIB 
Their spectra are obtained by taking the direct product of one open string spectrum each for the 
left and the right movers of the closed string, giving rise to the four sectors R-R, R-NS, NS-R 
and NS-NS. As before, left-mover operators carry a tilde, and since the effective field theory we 
are ultimately interested in only contains the massless states, and we restrict ourselves to these 
here. The state space then is summarized as [151]: 

TYPE IIA TYPE IIB 

R-R |-)r«)|+)r |+)r«)|+)r 

R-NS \-)R^f3U/2\^hs \+)R^f3U/2\^Us 

NS-R PU/2\^Us^\+h PU/2\^Us^\+)r 

NS-NS PU/2\^Us ® Pii/2\^)m Pli/2\^hs ® PIi/2\^)nS 



Note that there are two chiralities in the type IIA case, but only one in type IIB. We now list 
the fields in the different sectors. 

• NS-NS sector 

It is easily seen from the above table that this sector is shared by both theories. Its 
massless states are bosonic and give the symmetric graviton g^n, the anti-symmetric Kalb 
Ramond tensor gauge field Bmn and the dilaton 

• NS-R and R-NS sectors 

Note that the first of these is again the same for both theories, but in the R-NS sectors the 
chiralities are different. The (fermionic) fields in these sectors are a spin-3/2 "gravitino" 
and a spin- 1/2 "dilatino" each. The presence of two gravitinos means that type IIA and 

*The chirality in this case is chosen positive for definiteness. 

^Recall that in Section 5.1.1 we the notion of a world sheet parity transformation that turns oriented into 
unoriented strings. 
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type IIB superstring theory have M = 2 supersymmetry in ten dimensions. In type IIA 
these gravitinos have opposite chirality. 

• R-R sector 

This sector again contains bosons. In the type IIA case, one obtains a U{1) gauge field 
Ajji and a three-form Aimn- For type IIB, there is a zero-form {i.e. a scalar) Co, a two- 
form Cmn and a four- form field Ckimn- Note that each of these fields (with n indices) is 
associated with a field strength F^^^Xi) ~ ^^(n) — 1)3), or FJ^^-^^ = dC(„) (n = 0,2,4), 
respectively, and they are summarily referred to as the "R-R gauge potentials" . We shall 
see later that the number of indices of the fields present determines the dimensionality 
{p + 1) of each theory's stable Dp-branes. 

Given their state spectrum, both type IIA and IIB have N = 2 supergravity in ten dimensions, 
but type IIB is non-chiral on the world sheet. (Another way to see this is that the world sheet 
parity transformation O is a symmetry of type IIB because exchanges left- and right-movers, 
and in type IIB they have the same chirality.) Our focus in later Chapters is on cosmological 
models living in type II theories, therefore our account of the remaining three superstring theories 
below is shorter. 

Theory of open and closed unoriented superstrings: type I 

In this theory, unoriented open strings are combined with an "unoriented" version of type IIB 
theory of closed strings, ie. the world sheet parity symmetry of type IIB is gauged away. The 
NS-R and RNS sectors transform into each other under the Vi operation, and only one copy 
remains in the unoriented theory. It can be shown that type I has the field content of a chiral 
M = 1 supergravity multiplet: the dilaton the graviton g^nn with its gravitino, and an anti- 
symmetric tensor field Cmn [as well as the gauge boson and the gaugino of a A/" = 1 super Yang 
Mills theory of 5*0(32)]. The Kalb Ramond field B^n is absent from the spectrum because it 
does not survive the "unorientation" procedure. 

Theories of heterotic closed oriented strings: heterotic 50(32) and x 
In these theories, the decoupling of left- and right-movers is used to make only the latter ones 
supersymmetric, while the left-moving degrees of freedom are described by bosonic string theory. 
However, the bosonic string is consistent with Lorentz invariance in 26 dimensions, while d = 10 
for the superstring. Hence, 16 of the left-movers must be compactified, and this can only be 
done on a special type of internal lattice. The two heterotic theories differ by the choice of 
lattice, which can either be E^ x £'§ or 50(32). When the supersymmetric right-movers and 
the bosonic, compactified left-movers are combined, one obtains A/" = 1 supergravity in ten 
spacetime dimensions with the dilaton the graviton gmn plus gravitino, the anti-symmetric 
Bmn [again together with Af = 1 super Yang Mills theory of either Eg x Eg or 50(32)]. 

These five different formulations of superstring theory are related among each other by a close 
web of so-called dualities, which is sketched in Fig. 5.2 and explored further in Section 6.3. As 
a consequence, it is strongly believed that they are but distinct realizations of one underlying 
theory, currently dubbed "M-theory". In contrast to superstring theories, M-theory lives in 
eleven dimensions, and its fundamental objects are not strings, but membranes. M-theory is 
still very much under development, but it is known that in its low energy limit, it most likely 
reduces to = 1 supergravity for d = 11. We do not venture further into the realm of M-theory 
here, but we shall use its eleven-dimensional supergravity description shortly when establishing 
the effective d = 10 supersymmetric action of type IIA. First, however, we pause for a brief 
discussion of the dilaton field 
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Figure 5.2.: Left: The five difi^erent superstring theories (type IIA and IIB, type I, heterotic SO{32) 
and heterotic Eg x £'§) are related by a web of dualities: these are T- and S'-duality (ex- 
changing compactification radii i? — > and the dilaton $ — > — $, see Section 6.3), and 
the "unorientation" procedure il. By compactification on a circle S^, type IIA superstring 
theory in ten dimensions can be obtained from eleven-dimensional supergravity, which is 
strongly believed to be the low-energy limit of M-theory. Top right: Closed string Feynman 
diagram at tree order: because of the extended nature of the strings, interactions no longer 
take place at a fixed point in spacetime, and the theory is therefore UV finite. The lines are 
"cuts" of the tree level diagram on different time hypersurfaces ti < T2 < T3. Bottom right: 
Closed string Feynman diagram at loop order: for closed oriented surfaces, the number of 
handles counts the number of loops. 



5.2. The Special Role of the Dilaton 



Taking closer look at the dilaton one can appreciate a remarkable property of string theory as 
opposed to the quantum field theory of point particles: in the latter, many Feynman diagrams 
have to be calculated at each loop order for a given interaction process, and a cutoff Auv has 
to be imposed on the momenta running around the loop to make the theory finite in the UV. 
In string theory, on the other hand, there is a single diagram at any loop order nioop, and there 
are no UV divergencies. An illustrative explanation of this is given in Fig. 5.2. 

Let us Euclideanize the (so far) Lorentzian world sheet metric hat by taking r — )• —ir, and define 
complex coordinates {z, z) in local patches by 

z = e^-''', z = 6^+'" . (5.45) 

The world sheet can now be regarded as a Riemann surface, i.e. a one-dimensional complex 
manifold. The Polyakov action (5.5) we wrote down in Section 5.1.1 is at lowest order in and 
expansion in terms of a' (visible in the prefactor, ie. the string tensions T oc 1/a')- At the 
next order 0(a"^), Eq. (5.5) is supplemented by a term S oc J (P^ \/—h R^'^\ where -R*-^) is 
scalar curvature on the two-dimensional world sheet with metric hab- This term serves as the 
"string coupling constant" in the following way: using the z coordinates defined in Eq. (5.45), 
the contribution of the dilaton <I> in the spectrum of the closed string to the world sheet action 
up to order is [151] 

Sq> = ^ [ d^zV^^R^^'^. (5.46) 
47r 7s 
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If the dilaton is constant, $ = $o, the integrand in Eq. (5.46) is a total derivative, and therefore 
»S$ is determined by the global topology of the world sheet. More precisely, it corresponds to 
the Euler characteristic of the surface S, 

X(S) = / d^z\/hR^'^^ = 2 - 2nh - nb - nc . (5.47) 

Here, rih (the genus of E) counts the number of "handles" , nb the number of boundaries, and ric 

the number of cross-caps of the Riemann surface in question. In a theory of only closed strings 
(as it is the case for type IIA and type IIB), the world sheet has no boundaries (which would 
be created by the ends of open strings). The type IIA/B fundamental strings are also oriented, 
therefore S is necessarily orientable and has no cross-caps. With nb = rzc = 0, and the only 
possible world sheet topologies are closed and oriented Riemann surfaces, which are uniquely 
characterized by nh, and this is also the number of loops nioop in the corresponding Feynman 
diagram (see Fig. 5.2). The Euler characteristic then is x = 2 — 2nh, and from Eq. (5.46) we 
find 5$ = $o(2- 2nh). 

When string theoretic Feynman diagrams are calculated using the path integral formalism, 

Z oc y [Vhab] J [DX""] ■ ■ ■ e-('5='ri"s+'5*) , (5.48) 

the first integral runs over all possible world sheet topologies, with the world sheet diffeomor- 
phisms and Weyl scalings that leave the string action iSgtring invariant "taken out" . Therefore, Z 
is found from a perturbative expansion in the number of loops (or handles) of the world sheet, 
Z = Ylnl=o -^"h- -^^^ ^^^^ level diagram with = 0, the dilaton contribution to the action 
5$ then reads e"^*" = g~'^, and each handle gives an additional factor of g"^. In this sense, the 
"string coupling" gg is the expansion parameter in the number of string loops, and one must 
have < 1 for a perturbative regime. Note that we considered the dilaton constant, but in 
principle it is a dynamical scalar field, which must be stabilized at a value $o to allow for the 
above reasoning. This is a first example of a dimcnsionless parameter in string theory (here, the 
coupling gs) determined by the vacuum expectation value of a scalar field, as we mentioned at 
the very beginning of this Chapter. 



5.3. Effective Supergravity Field Theories 



Apart from the number of string loops, parametrized by there is a second expansion param- 
eter: the (dimensionful) Regge slope a' = is a measure of "stringiness" since it describes the 
diff^erencc between a one-dimensional string and a point particle, i.e. the limit a' — t- 0. Note that 
this is also the low energy limit because (in a time-independent background) a'E'^ is the only 
dimcnsionless combination with a' . In Section 5.1.3, we restricted ourselves to massless fields in 
the spectrum because when turning off the "stringiness" by taking a' — > 0, all but the massless 
particles become infinitely heavy, see Eqs. (5.42) and (5.43). The field content of the different 
superstring theories listed in Section 5.1.3 suggested that they have N = 2 (or J\[ = 1) local 
supersymmetry in ten dimensions, and hence we may expect a d = 10 supergravity form for 
their low energy effective actions. We only consider the bosonic part of supergravity actions in 
the following since we are looking for classical solutions without fermionic degrees of freedom. 
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5.3.1. Type MA Supergravity 



The supergravity description of type IIA theory is the most straightforward to obtain: we stated 
earher that the low energy hmit of M-theory (see Fig. 5.2) is eleven-dimensional supergravity, 
and it is known that type IIA theory follows from it by simple "dimensional reduction" (sec 
below) on a circle of radius R. At the effective supergravity level, the action in eleven di- 
mensions is constrained from the requirements of level matching between fermionic and bosonic 
degrees of freedom (to preserve supersymmetry) , general coordinate invariance and local Lorentz 
invariance, and the bosonic part of this unique action reads 

= i / d^'^^ (^^''^ - I 1^(4)1') - I J ^(3) A F(4) A F(4) . (5.49) 

The eleven-dimensional gravitational coupling ku is related to the d = 11 Planck length £^^^^ 

by 2Kn = [2tt£^^^^^ /27r, and R^^^^ is the Ricci scalar in eleven dimensions (obtained from 
the metric"^'' Gm'n' with determinant G). The field strength F(4) = d^(3-) is obtained from the 
three-form gauge potential introduced to match the number of fermionic degrees of freedom 
(whose action we do not display). Dimensional reduction is a procedure where one dimension 
is "curled up" on a circle, and only the zero modes of the corresponding Fourier expansion in 
that dimension are kept for each of the fields in the original higher-dimensional theory. (For 
compactification, the entire tower of modes survives in the lower dimensionality, see Section 6.2.) 
In the case at hand, reducing Eq. (5.49) down to ten dimensions produces the d = 10 graviton 
9mn, a gauge field and the dilaton $ from the eleven-dimensional metric Gm'n'- Studying the 
eleven-dimensional line clement dsf^ in this decomposition G,n'n' ^ {gmm A^,^), one obtains 
the relation ip^^ = gV'^ts between the eleven-dimensional Planck and the fundamental string 
scale [151]. The three-form ^4^3) in d = 11 leads to a three-form and a two form, Ajnnr and -B^„, 
at the ten-dimensional level. One also finds that the four-form field strength has to be replaced 
by the gauge-invariant combination 

F(4) = dA(3) + A F(3) , (5.50) 

where i/(3) = di?(2)- (This is necessary to preserve invariance under supersymmetry variations.) 
The integration over the compact eleventh coordinate is performed with the radius of the circle^^ 
set to i? = gsls- For the gravitational coupling constant in ten dimensions, this gives 2kiq = 
(27r4)^5's/(2vr) = 2kiQg^ (note that a factor of g1 has been taken out in the last definition). 
The type IIA supergravity action can then schematically be written as 5^^^ = 5ns + 5^^ +^^0^' 
where the three contributions (in the "string frame" , see below) read: 

5ns = ^yd'°xV=9<!-2*(iS + 48„«8''*-i |if(3)|') (5.51) 



(5.52) 

J ^. . . . / 

'^"s' = -J^J ^(2) A F(4) A F(4) (5.53) 

Note that the NS part in the first line does not carry a superscript "type IIA" because, as 
we saw at the level of fundamental massless states in Section 5.1.3, type IIA and type IIB 



^"indices with primes run over eleven (not ten) dimensions in this Section, i.e.m! ,n' = 0, . . . , 10, while m,n = 
0,...,9. 

'^^Following our remarks about 5s being determined by the vacuum expectation value of the dilaton (Section 5.2), 
the compactification radius R is therefore again in principle a dynamical quantity. In this sense, one sometimes 
speaks fo the eleventh dimension "opening up" in the limit of strong (non-perturbative) coupling ^ 1- 
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superstring theory have the same NS-NS sector, therefore we wih equally use Eq. (5.51) in type 
IIB supergravity below. The prefactor e~^* of the Ricci scalar reminds us that these actions 
are written in the string frame, and "plain" ten-dimensional General Relativity is obtained by 
transforming to the Einstein frame, with the two metrics related by ^mn™^^ = e*/^^^. Re- 
writing the NS part of the action in the latter frame gives 

In the following, we drop the superscript "(E)" and always work in the Einstein frame where 
gravity takes its canonical form. 



5.3.2. Type IIB Supergravity 

The most prominent difference between type IIA and type IIB arises from the different R-R 
gauge potentials C(„) they contain, with vP-^ = 1,3 and vP^^ = 0,2,4. We already mentioned 
that the field strength F(5) = dC(4) then must be self-dual in ten dimensions. This is one of the 
obstructions to formulating a manifestly supersymmetric action for type IIB supergravity. One 
possible strategy is to work directly with the equations of motion, which can be written in a 
covariant way, and moreover may be obtained one from the other by supersymmetric variation. 
Alternatively, one may resort to writing an action which is not by itself invariant under local 
supersymmetry, but has to be supplemented with the self-duality constraint F^^-^ = *F(^5) on the 
gauge- invariant field strength combination, see below. (This allows the elimination of superfluous 
bosonic degrees of freedom on top of those needed to match the fermionic ones.) The NS part of 
the action is the same as in type IIA, see Eq. (5.54). Its other parts in the Einstein frame read 

^^"^ = -i/d"-V=.(e-|F(,)|^ + e*|F(3)f + ^|F(3)f), (5.55) 
Here the notation used is -F(n+i) = dC(„), H^^^ = diJ(2), and tildes indicate combinations 



-^(3) = -^(3) - C'(0)-f^(3) ) (5.57) 

1 „ „ 1 

2 



-^(5) = -^(5) - ^ C'(2) A ii"(3) -I- - -6(2) A F(3) . (5.58) 



On the solutions to the equations of motion derived from the complete type IIB action S^^^ = 
<Sns + + ^cs ' impose the self-duality condition for F(5) . 

Note that type IIB supergravity has two scalar fields, C(o) and the dilaton <I>, as well as two 
two- forms, the Kalb Ramond field i?(2) ^^^cl the R-R potential C(2)- It turns out that the 
action 5^^^ built from Eqs. (5.54), (5.55) and (5.56) has a symmetry under transformations by 
the special linear group SL{2,R), which is not manifest in the above notation. To make this 
symmetry explicit, one may regroup the dilaton and the C(o) field (often called the "axion") into 
the complex axion-dilaton field r, and assemble the two-form potentials into a two-component 
vector as in 

^ = ^(0) + ^^-*, i3(2) = ( ^J^^j ) . (5.59) 
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Note that the field strength "^(3) that fohows from i3(2) then also has two components. The type 
IIB action in the Einstein frame can then be rewritten as 



-HB 



2kio J 
8kio \J 



XV -9 

10 



i? - ^ Hjs) M n^^^ + ^ tr {^^'M d^M) 

2 



C(4) A n}-,, A n 



(3) 



'(3) 



(5.60) 



where the matrix M. and the field strength F5 are given by 

^(0) 



M = e 



\t\ 



-(0) 



1 



i^5 = i^5+2£u^(2)AHj3) 



(5.61) 



using the definitions of Eqs. (5.59). The manifestly S'L(2,R) invariant notation of Eq. (5.60) is 
useful to find solutions for all the bosonic background fields of type IIB supergravity. To this 
end and for later use, we also define the complex three-form 



(3) = ^(3) 



(3) 



(5.62) 



which is (in the applications we will consider) imaginary self-dual, i.e. it obeys *G(3) = iG^^y 
We consider the background equations of motion derived from this type IIB supergravity action 
in Section 7.3.1. 



In this Chapter, we established effective ten-dimensional supergravity actions as the low-energy 
limits of both type IIA and type IIB superstring theory. The fields gmn, Bmn, ^ as well as the 
R-R gauge potentials C(„) make up the bulk field content of these theories. There are, however, 
additional ingredients to the picture: for example, it is possible to embed hypersurfaces, called 
"p-branes", into the supergravity backgrounds. In the next Chapter, we study a particular 
class of branes, so-called Dirichlet or D-branes, for short. We also address the question of 
compactifying this Chapter's ten-dimensional actions down to the four-dimensional world of our 
everyday experience. 
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The bulk supergravity actions obtained in Chapter 5 do not cover the full range 
of string theoretic building blocks available for the construction of a unified theory. 
In this Chapter, we discuss non-perturbative string objects called D-branes. We also 
show how one can establish contact between the d = 10 picture of superstring theory 
and our low-energy world where we observe but four dimensions. This also sheds light 
on the relations ("dualities") between the five different formulations of superstring 
theory. 



6.1. D-Branes 

When we solved the equations of motion for open strings in Section 5.1.1 for the bosonic and in 
Section 5.1.2 for the superstring case, we only considered Neumann boundary conditions: they 
are reasonable because they conserve momentum at the string endpoints. It was later realized, 
however, that open strings can have Dirichlet boundary conditions provided that their ends in 
these dimensions are attached to dynamical objects called Dp-branes, where the "D" stands 
for Dirichlet, and p denotes the number of spatial dimensions of the brane, see Fig. 6.1. [In 
spacetime, D;j-branes then trace out a (p + 1) dimensional world volume, see Fig. 5.1.] The 
momentum flowing out of the string endpoints is conserved by the brane. Apart from the 
spacetime filling D-brane with p = d — 1, the presence of these objects breaks d-dimensional 
Poincare invariance. 

6.1.1. Chern Simons (Charge) Term 

Recall that we stated earlier the analogy between the electromagnetic world line coupling for 
charged point particles, q Jrp dx^ A^, and a coupling of the anti-symmetric Kalb Ramond tensor 
field Bran to the string world sheet with the ten-dimensional field -Bmn "pulled back" onto S. 
[Compare the pullback of the metric gmn in Eq. (5.2).] In this sense, fundamental strings are 
charged under the field B„in. It turns out that, analogously, D-branes carry charges correspond- 
ing to the R-R gauge potentials we encountered in the state spectra of type IIA/B superstring 
theories. There is a natural coupling between a R-R form C(„) with n = p + 1 indices and the 
world volume Sp+i of a Dp-brane, 

where the ^* (i = 0, . . . ,p) are the coordinates on the world volume of the brane. The index 
"CS" stands for "Chern Simons" , and the proportionality constant /ip will be explained shortly. 
To be precise, the gauge potentials entering into the Chern Simons term of D-brane actions may 
differ by field redefinitions from the ones in the bulk supergravity actions of Chapter 5. This is 
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Figure 6.1.: Left: D-branes are dynamical objects to which the ends of open strings can attach if they 
have Dirichlet boundary conditions. On the world volume of the branes, (supersymmetric) 
gauge theories can live, whose fields correspond to the masslcss modes of the open strings 
attached to the brane. The excitation spectrum for strings with both ends on the brane is 
tachyon-free. Closed strings can propagate the entire ten-dimensional bulk. Right: Branes 
interact through the exchange of closed string modes that penetrate the hulk, and open 
strings can also stretch between different branes. In a brane-anti-brane system, these strings 
have tachyonic modes and announce the onset of mutual brane annihilation into closed string 
annihilation. 



because the supergravity theories established previously with the use of symmetry arguments are 
unique up to those redefinitions, which means that the identification between the supergravity 
fields and the string modes is ambiguous [154]. 

Coupling as in Eq. (6.1) is called "electric" because of the analogy with (ix'^A^. Since the n-forms 
have n = 1,3 (i.e. odd) in type IIA and n = 0,2,4 (i.e. even) in type IIB superstring theory, 
wc immediately sec that p must be even in the former, and p odd in the latter case. The T)p- 
brancs then serve as (electric) sources for the C(j,_|_i) gauge potentials in each theory. Apart from 
Eq. (6.1), there is also "magnetic" coupling: in this case, the electromagnetic analogy is with 
the field strength oc d^Ai, — di,A^, which creates a flux through a two-dimensional surface 
S'^, as is described by Gauss' law. In d = 10 dimensions, the R-R field strength = dC(jj_i) 
creates a flux through a (d — 2 — n) = (8 — n) dimensional hypersurface, written as S^~^: 
therefore, a C(„_i) potential experiences electric coupling to Dp-branes with p = n — 2, and 
magnetic coupling to a brane with p = 8 — n. [From the point of view of the branes, a Dp-brane 
is an electric souce for the same potential for which a D(6 — p) brane is a magnetic source.] 
Mathematically, this is described by the Hodge duality or "star" operator * (see Appendix A 
and [155, 151]). The higher-dimensional generalization of Gauss' law then allows one to calculate 
the electric brane charge /ip from 

/xp = / *F{p+2) ■ (6.2) 

This charge, and the one of the magnetically sourcing D(6 — p)-brane, are subject to the Dirac 
quantization condition with jipjiQ-p G 27rZ. Note that in type IIB, the D3-brane is special 
because it carries a self-dual charge: the (five-index) field strength of the four-form potential 
F(5) = dC(4) must be self-dual in ten dimensions, F(5) = *i^(5). [Note that the type IIB bulk 
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supergravity action of Section 5.3.2 required a gauge-invariant redefiniton of this field strength, 
see Eq. (5.58).] 

Branes with p even in type IIA and p odd in type IIB are stable, and they preserve half of the 
supersymmctry of the background. [Branes of the other dimensionalities are unstable in the 
respective theory because there are no gauge potentials for them to couple to, see Eq. (6.1). 
If present, these branes break all of the supersymmetry.] If more than one brane is embedded 
into the same background, they can interact among themselves by exchanging (closed string) 
graviton and R-R modes. Gravity is described by the so-called Dirac Born Infcld (DBI) term of 
the brane world volume action, which we discuss in the next Section. In principle, closed string 
dilaton modes are also exchanged, but in the applications we consider, the dilaton is usually a 
constant, therefore we do not take them into account here. 

Like the repelling force between two particles of the same charge q, the R-R interaction tries 
to drive two Dp-branes (with charges fip) apart, but for parallel branes, the attractive gravita- 
tional force exactly balances this. (For the same reason, a single D-brane put in a supergravity 
background does not experience any force: its CS and the DBI terms have opposite signs, see 
below, and cancel exactly.) A system of D-branes only is therefore stable and also preserves half 
the supersymmctry. 

If anti-D-branes (with opposite R-R charge — /itp, written as Dp) are introduced, they attract D- 
branes and supersymmetry is broken. (Put a simple way, anti-D-branes can only be introduced 
into the theory at the quantum level, therefore one has necessarily left the realm of classical 
supergravity solutions obtained from the bosonic part of the action in Chapter 5.) Since for an 
anti-brane, the sign of the CS term is reversed, the cancellation with the DBI action vanishes 
for a an anti-brane: even by itself, an anti-brane in a supergravity background will feel a force 
and try to minimize its energy. Combinations of D- and D-branes are of particular interest in 
cosmology, as we shall see later. 



6.1.2. Dirac Born Infeld (Dynamical) Term 

A theory with D-branes must also contain open strings (that end on the branes) apart from 
closed ones, because a string loop hitting a D-brane can break into an open string with both 
ends attached to the brane, see Fig. 6.1. When the branes carry conserved charges [that is, if 
they are of a dimensionality p that is stable in the corresponding theory], one can show that 
there are no tachyons in the spectrum of open strings starting and ending on branes. 
On the world volume of D-branes, gauge theories can reside (which can possibly be non-abelian 
for multiple coincident branes, see below), and these theories have as much supersymmetry as 
the background after introduction of the branes. The fields in the spectrum of the gauge theory 
correspond to the massless modes of the open strings attached to them. Put another way, the 
dynamics of a Dp-brane are described by the {p + l)-dimensional effective theory of massless 
fields living on the brane. 



.5dbi = -Tp j dP+^e { - det [^ab + Bab + 27ra' Fab] ] ^'^ , 
where the dimensionful parameter Tp is the D-brane tension, where in our units 



(6.3) 



g,i2Trfa'^+^^/^ . (6.4) 



The subscript "DBI" in Eq. (6.3) stands for "Dirac Born Infeld", and the two-form B^2) is the 
pullback onto the brane of the two-form -B^„ in ten-dimensional spacetime, in straight analogy 
to the pullback of the metric 7a(,, compare Eq. (5.2). The second two- form F(2) arises from a 
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U{1) gauge potential associated with the strings with both ends on the brane. A derivation 
of this dynamical brane action can be found e.g. in [152, 151]. In the so-called static gauge, 
diffeomorphism symmetry is used to align the (p+l) world volume coordinates ^* with the first 
(p + l) components of the bosonic spacetime coordinates X"^, while the other {9—p) coordinates 
become scalar fields on the world volume of the brane describing transverse excitations, see 
Fig. 6.2. Very often, it is reasonable to consider the case of vanishing pullback 5(2) s-^id gauge 
field F(2) in Eq. (6.3). For completeness, however, let us note that if these two-forms are 
non-zero on a Dp-brane, this brane can also carry induced charges of D(p — 2n)-type, where 
n = 0, 1, . . . . In these cases, the Chern Simons term of the brane action instead of Eq. (6.1) is 
written as Sqs = f^p f [C exp{B + 27ra' F)]i^p_^^y with the {p + l)-dimensional piece extracted, 

and C = Yln=o ^{n)- 0-^ this sum it is understood that only the values of n permitted for the 
corresponding theory are counted.) A case of interest we will encounter later is a D5-brane, 
which, given its orientation in the background geometry (see below) also carries charges of D3- 
and Dl-brane type. Note that a Dl-brane, while geometrically a string ("D-string"), must not 
be confused with the fundamental strings of the theory ("F-strings"): the former is charged 
under the R-R two-form potential C(2) of type IIB theory, while the fundamental Fl-strings are 
charged under the NS-NS Kalb Ramond anti-symmetric tensor field -6(2)- 

In the right units and if the dilaton is constant, the charge and tension of a brane are equal in 
the Einstein frame (in the anti-brane case, up to a sign), Tp = =b/ip. This makes it possible for 
the CS and DBI terms to cancel for D-branes (hence they can be moved around in a supergravity 
background without energy cost), and add for anti-branes (which will seek to minimize their 
energy at a preferred position in the background). From Eq. (6.4), one sees that Tp cx g~^, 
meaning that Dp-branes become infinitely heavy (i.e. non-dynamical) in the weak coupling limit 
Qs ^ 0. In this sense, they are non-perturbative excitations of string theory. 




Figure 6.2.: Left: When D-branes are embedded into the higher-dimensional spacetime, their world 
volume directions can be aligned with certain coordinates of spacetime in the so-called 
static gauge. Right: In the spacetime directions orthogonal to its volume, the D-brane can 
oscillate since it is a dynamical object. 
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6.1.3. Other Extended Objects 



A complete ten-dimensional supergravity action describing the low-energy limit of type IIA/B 
superstring theory consists of the bulk terms derived in Sections 5.3.1 and 5.3.2, respectively, 
plus contributions of the form of Eqs. (6.1) and (6.3) for any type of (stable) Dp-brane allowed 
by the background. There exist, however, also a variety of other (non-pcrturbative) string 
theoretic objects that may be included, if certain certain restrictions are respected. Here, we 
only comment on so-called orientifold planes, denoted by Op, where p is again the number of 
spatial dimensions, and the rules for their dimensionality in type IIA/B are the same as for Dp 
branes. These Op-plancs carry charges opposite to the Dp-branes, but they are not dynamic, 
since they are basically defined by the points left invariant under the orientifold projections 
mentioned in the last Chapter. (Recall that Dp branes are surfaces on which fundamental 
strings can end, and therefore dynamic.) Other objects are e.g. "instantons" [or D(— l)-branes 
in type IIB theory], and they will occasionally be mentioned later on. A thorough discussion of 
extended string objects can be found, for example, in [154]. 



6.2. From Ten to Four Dimensions 



We so far accepted without bewilderment that a string theoretic spacetime can have more than 
four dimensions. The embedding of p-dimensional hypersurfaces like D-branes opens up the 
interesting possibility of identifying our observable Universe with e.^. the (3-|-l)-dimensional 
world volume of a D3-brane in type IIB superstring theory (or with a stack of coincident branes, 
which is phenomenologically attractive from the gauge theory point of view). Models like this 
are often called "brane world scenarios", see e.g. [156] for a review. The gauge theories of the 
Standard Model could then be described by the open strings attached to the brane (to which 
the SM interactions therefore are confined), while gravitons (closed string modes) propagate in 
the entire d = 10 spacetime. This suggests an intuitive explanation why gravity is much weaker 
than the other forces of Nature [157, 158]. 

However, setting aside the "brane world" approach for the moment, we can also obtain a four- 
dimensional theory from the d = 10 supergravity actions of Section 5.3 by compactifying six out 
of the nine spatial dimensions. Indeed, there is a precedent that the attempt to unify interac- 
tions may proceed using additional space dimensions, the so-called Kaluza Klein compactification 
procedure [159, 160]: starting from five-dimensional General Relativity with spacetime coordi- 
nates {t,x,y), one obtains in the effective four-dimensional theory the usual d = 4 gravity, but 
also a four- vector field (which maybe associated with the gauge field of electromagnetism), 
plus a scalar field (the ancestor of the string dilaton $). The additional dimension y is made 
compact, ie. one identifies it with a circle by setting y y + 2itR. All fields in the theory are 
then periodic functions of y, and may hence be developed in a Fourier series in this dimension, 
giving rise to the so-called Kaluza Klein modes. (Note that when, in search for the type HA 
supergravity action in Section 5.3.1, we applied dimensional reduction to eleven-dimensional 
supergravity, we only kept the zero-modes in this Fourier expansion.) At the quantum level, 
an infinite tower of additional energy states is found, with masses rriKK.n quantized in terms 
of the inverse radius 1/R of the extra dimension. If R is small (i.e. the compactification scale 
= 1/-R is large), the new energy levels lie far above those of the effective four-dimensional 
theory and go unnoticed in low energy experiments [149]. Let us also state the effect of a Kaluza 
Klein compactification on the Ricci scalar term of an Einstein frame supergravity action in d 
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dimensions, irip^^ J (M^x-^ R^'^\ which after compactification reads 

Vd-Am m^^2 j d^x^Ri^) + . . . . (6.5) 

By mpi^d we have denoted the Planck mass in d dimensions, and M is the {d — 4)-dimensional 
space into which all but four dimensions have been compactified. Its volume therefore enters into 
the (observed) four-dimensional Planck mass nipi as rripi = mp~^^ Vrf_4(M). The compactification 
(or Kaluza Klein) mass scale rric is just the inverse of the compact volume, i.e. we have in our 
units mf~^ = 1/V^-4(M). Using this relation, one can establish an upper bound on the size of 
the compact space since the tower of Kaluza Klein excitations in the particle spectrum has not 
been observed yet. It also follows that, for the four-dimensional Planck mass to be constant, the 
compactification volume must be fixed, which, as we show below, is usually non-trival in string 
theory. 

6.2.1. Internal and External Spaces 

In the d = 10 superstring theories, one has to compactify not one, but six spatial dimensions to 
(potentially) get to our observed four-dimensional world. In these remaining four dimensions, 
we wish to preserve Poincare invariance, therefore the ten-dimensional metric is split into a 
direct product of a (dext = 4) external and a (dint = 6) internal manifold, Mio = M4 x Mq. In 
the external space, the coordinates read x'^ (/U = 0, ... ,3), and on the compact internal space 
(M = 4, . . . , 9). The internal metric guN does not depend on the external coordinates, and 
we specify to Minkowski space with rj^i, in four dimensions, so that the ten-dimensional line 
element reads 

dslo = ri^u dx^ + mN{y) dy"^ dj/^ . (6.6) 

Note that, as a consequence, the ten-dimensional Ricci tensor Rmn splits into a separate external 
piece Rjxvi and an internal piece Rmn, while the mixed components vanish. Let us also remark 
that the requirement of maximal symmetry in the extended dimensions restricts the possible 
ansatze one can make for the other background fields such as -Bmn and the R-R gauge potentials: 
their field strength components with one or more indices along the extended dimensions vanish. 

In a straightforward generalization of Kaluza Klein, each of the six extra dimensions can be 
curled up in a circle, corresponding to toroidal compactification^. We consider the important 
example of the two-dimensional torus T below, but in general the resulting d = 4 theory after 
toroidal compactification has too much supersymmetry. For example, in the case of type IIB 
theory, one would obtain = 8 supersymmetry in four dimensions. Requesting that only 
a "reasonable" amount of supersymmetry be preserved in four spacetime dimensions (more 
precisely A/" = 2 for type IIA/B, and A/" = 1 for type I and heterotic theories) suggests a 
compactification on more complicated spaces known as Calabi Yau (CY) manifolds. (This can 
be shown by imposing that the internal manifold admits a covariantly constant spinor.) In 
general, CY compactifications preserve 3/4 of the original supersymmetry: the original M = 8 
for type IIB is reduced to A/" = 2, which may be broken down further to A/" = 1 e.g. by adding 
D-brancs to the background. A vast selection of CY manifolds with three complex (hence six 
real) dimensions exists [161], and the choice made to compactify a given theory is of crucial 
importance for the resulting four-dimensional physics at low energies: as mentioned earlier, 
dimensionless physical parameters in string theory are determined by the expectation values of 

Another simple idea is to compactify on "orbifolds" (i.e. manifolds with certain points taken out) because the 
equations of motion for the superstring Eq. (5.30) then stay very simple. 
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Figure 6.3.: Left: The six internal dimensions of a ten-dimensional supergravity description of string 



theory are compactified on a Calabi Yau manifold, which breaks 3/4 of the original su- 
persymmetry in the background. CY manifolds are complex manifolds with coordinate 
zi,Z2,Z3 and a Kahler metric. Right: More general compactification schemes arc possible, 
i.e. one may include a warp factor into the ten-dimensional metric. As we show later, these 
geometries typically have the form of a cone. The metric on these (non-compacts) cones 
is usually known, but they have to be attached to a (compact) CY bulk manifold at their 
UV end, to make the total compactification volume (and hence the Planck constant in four 
dimensions) a constant. 



scalar fields, and these expectation values in turn are a consequence of the particular higher- 
dimensional geometry. One may hence schematically write »SiO(i[C] — > S^d, where C stands for 

the chosen compactification [162]. Moreover, the supergravity approximation is justified when 
the size of the compact internal manifold is large compared to the string scale, i.e. when the 
mass is much smaller than m^. 



6.2.2. Calabi Yau Manifolds 

A Calabi Yau n-fold is a Kdhler manifold in n complex dimensions with SU{n) holonomy (see 
Appendix A). In one complex dimension, these are the complex plane C (which is non-compact), 
and the torus T (which is compact). For n = 3, many thousand CY manifolds are known, and 
their number may even be infinite. A compact Calabi Yau n-fold is Ricci flat, i.c-Rmn = 
(here, M,N are indices running over the 2n real coordinates). If the CY has a metric, the 
Beta numbers b^, k = 0, . . . , n count the number of linearly independent harmonic p- forms on 
the manifold. For Kahler manifolds, the Betti numbers can be decomposed in terms of the 
Hodge numbers hP'"^ which give the number of harmonic (p, g)-forms (see Appendix A.2), with 
bk = Yl^=o hP''^~^- Note that there is no requirement to know the exact form of the metric on 
the Calabi Yau; indeed it is not known for any non-trivial compact example. 
One can characterize Calabi Yau manifolds topologically by their Hodge numbers /i^''^, where 
p,q = 0, . . . , n, and hP''^ = h'^'P from complex conjugation. One also has h^'*^ = from 
Poincare duality. Since for any complex manifold = 1 (which means that constant functions 
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can exist on the manifold), and h^'^ = for simply-connected manifolds^, the characterization 
of the n = 3 Calabi Yau manifold only requires h^'^ and /i^'^. (This is illustrated by grouping 
the Hodge numbers together in the Hodge diamond for CY n-folds, see e.g. [155, 151]. Note that 
h^'^ = 1 by Poincare duality.) The Euler characteristic [compare Eq. (5.47)] of the CY then is 

However, CY manifolds with specified Hodge numbers are not unique: inequivalent Calabi Yaus 
can have the same h^'^ and /i^'^, and be smoothly related among themselves by deformations of 
the parameters characterizing their size and shape, the so-called (scalar) "moduli fields" . The 
moduli fields with their respective possible values span the "moduli space" of parameters that 
may be changed without affecting the topology. In Eq. (6.6), it would therefore be more accurate 
to write (?„m(y,cj), indicating that the metric on the internal space is not only a function of the 
coordinates y, but also of all other parameters lo required to fix the geometry completely [9]. 
Once compactified to four dimensions (ie. at energies below the compactifiction scale m^), we 
must expect a massless Ad scalar field for each modulus of the extra-dimensional metric (and 
more from the compactification of the (7(p) forms in the respective theory). The fact that they 
are massless means that these fields do not enter into the effective scalar potential (at lowest 
order at least), and have a priori no reason to be stabilized at a given value. 



6.2.3. Warped Compactifications 

Compactification on compact Calabi Yau manifolds is consistent with the background fields 
and R-R gauge potentials (Bmn and C(p)) in the bulk supergravity actions of Section 5.3 set to 
zero. Note, however, that a supergravity background can include non-perturbative objects like 
D-branes. This has two important consequences: firstly, these objects source the R-R gauge 
potentials (a Dp is an electric source of C(p_|_i)), which (along with their associated "fluxes", 
i.e. the field strengths -F(p-|-2)) should then have non-zero background values. The fluxes are often 
said to "thread" cycles of the Mg manifold. (Recall that the gauge potentials' held strengths 
are forbidden to have "legs" along the extended dimensions by the requirement of maximal 
symmetry.) If some of the p spatial dimensions of a Dp-brane lie along the internal manifold 
Me, the branc is "wrapped" along the corresponding dimension (in type IIB, this will be the case 
for D-branes with p = 5,7,9 dimensions^). This is precisely a case where e.^. the induced i?(2) 
field that enters into the DBI brane action (6.3) no longer vanishes. As we mentioned in Section 
6.1.2, such a brane then carries also induced charges of lower p-type: for example, a D5-brane 
with two wrapped dimensions, it then also contributes a so-called "fractional" D3-charge. 
Secondly, Dp-branes embedded into the ten spacetime dimensions give rise to additional scalar 
moduli fields associated with their position within the internal manifold (see Figure 6.2), on top 
of the geometric moduli ui that arise from the choice of compactification. (Note that D-branes 
do not have world volume moduli because they minimize their energy at special positions in 
the background.) We shall see in the next Chapter that it is of crucial importance in string- 
cosmological model building that (almost all) moduli be fixed. Since our ultimate goal is to study 
models of inflation derived from string theory, let us rephrase this condition in "inflationary" 
terminology: the moduli fields (or at least most of them) should be heavy compared to the 
Hubble scale, rn^^^^ 3> H^, leaving only a manageable number of dynamic scalar fields during 
inflation. 



^The torus, which we consider in Section 7.1.1 as an example of two-dimensional compactification, is not simply 

connected and has h^'° = hP'^ = 1. 
^The D9-brane is not dynamical because it is spacetime-fiUing in ten dimensions. 
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The metric ansatz of Eq. (6.6) therefore must be generahzed to a larger class of compactification 
manifolds allowing for the presence of branes and non-zero background values for NS-NS and R- 
R fields. The ten-dimensional space in this case is a so-called "warped" product of a Minkowski 
space and an internal manifold. Its metric can be written as 

ds?o = h-^'\y) rj^, dx^^ Ax^ + h^/\y) guNiv) dy^ dy^ , (6.7) 

where the function h{y) (which depends on the internal coordinates y^) is called the warp 
factor. The internal manifold with metric gMN{y) is no longer necessarily Calabi Yau and can 
be non-compact. (It does not even have to be a Kahler or a complex manifold.) A well-known 
example of a non-compact CY used in "flux compactifications" is the so-called conifold we study 
below. In this particular case, even the explicit metric on the manifold is known. 
There are two main advantages in choosing a ten-dimensional metric of the form of Eq. (6.7) over 
one like Eq. (6.6). Firstly, the warp factor h{y) [which can have a very strong, i. e. exponential 
dependence on (proper) distances within the internal manifold] can be used to explain the hier- 
archy between the strength of physical interactions. This is analogous to the Randall Sundrum 
models [157, 158] (the first of the "brane world" scenarios mentioned earlier), apart from the 
fact that these live in five instead of ten dimensions. Secondly, as it will be discussed at length 
in later Chapters, the presence of fluxes allows for the stabilization of certain moduli fields, a 
feature much sought after in models of string cosmology. In particular, the non-compact conifold 
mentioned above comes in different varieties (called simple, deformed or resolved) depending on 
which of its internal cycles (if any) is threaded by fluxes. Intuitively, the moduli stabilization 
mechanism provided by fluxes (or branes wrapping internal cycles) may be understood as fol- 
lows: without fluxes, cycles in the internal manifold can be deformed at will without energy 
cost. If there is a flux along the cycle, changing its size modifies the energy distribution of the 
background, and the cycle is therefore stabilized at the position of least energy. In terms of the 
effective scalar potential of the lower-dimensional theory, it has acquired a mass. 
The conifold example, however, also points to a major problem of warped compactifications: 
the manifolds employed can be non-compact. At the technical level, this means that at least the 
modulus field corresponding to the overall compactification volume V% remains unfixed. This 
is problematic because Ve determines the four-dimensional Planck scale via Eq. (6.5). There- 
fore, the metric (6.7) is usually considered to be valid only in some "corner" of the overall 
six-dimensional space, and should be glued smoothly into a compact Calabi Yau bulk of the 
form (6.6) somewhere. On the one hand, this "patchwork" structure of the compactified dimen- 
sions is useful in string theoretic model building bccaTisc one can attach different corners with 
e.(/. different warp factors hi and /i2 [as well as different local metrics fi'^^^fjv' •S'mjv] i^-"^) 
the same bulk. One of these then might prodiicc inflation in the early Universe, while another 
one can contain the Standard Model of particle physics. On the other hand, the metric of the 
compact bulk connecting the two corners is unknown. 



6.3. Duality Relations 

In Section 5.1.3 we discussed the five superstring theories and mentioned that they are considered 
as different realizations of one underlying framework. This hope is fueled by the fact that, as 
indicated in Fig. 5.2, the distinct formulations can be obtained one from the other by certain 
transformations, on which we now comment. 

T-DUALITY 

The "T" refers to "target space" , because under this duality, compactification of a dimension 
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on a circle with radius R is exchanged with compactification on R = a' / R. (The string scale is 
often set to unity, a' = 1, hence T-duality takes R ^ The resulting physics in each case 

is identical, which illustrates the fact that spacetime geometry is "seen" differently by extended 
objects like strings than it would be by point particles. 

A T-duality transformation takes the two type II string theories one into the other, as well as the 
two heterotic theories. (If more than one direction is toroidially compactified, an even number of 
transformations gives back the same theory on the dual torus.) More precisely, R and R are the 
two limits of a continuous change of the compactification radius, which is a dynamical quantity. 
Note that in this sense, the string coupling gs measures the compactification radius of the 
eleventh dimension that was curled up (with gg = a/c7 R) to obtain type IIA superstring theory 
from d = 11 supergravity. T-duality moreover justifies the existence of D-branes as fundamental 
objects of string theory because it can be shown that by taking R — t- a' /R, Neumann boundary 
conditions in the direction of the compactified dimension map into Dirichlet boundary conditions. 

S'-DUALITY 

This duality relates the limits of small and strong string coupling by exchanging gs — )• 1/gs- 
Again, recall that is fixed by the vacuum expectation value of the dilaton^, g^ = e*°, therefore 
S'-duality is a field transformation taking the dilaton $ ^ — $ (and is in this sense again 
continuous). Remarkably, it is possible to study this duality at the level of the supergravity 
effective actions for the five theories (of which we only discussed the type IIA and type IIB cases 
in Section 5.3). This can be understood as supersymmetry "protecting" string quantities as the 
extrapolation from weak to strong coupling is carried out. iS-duality relates type I to SO{32) 
heterotic theory, while type IIB is self-dual under it. When type IIA theory is taken to the 
strong coupling limit, it "grows" an eleventh dimension (see above), leading to the low-energy 
supergravity limit of M-theory. 

Mirror symmetry and geometric transitions 

Mirror symmetry is a property relating certain Calabi Yau manifolds, and is most easily under- 
stood as an analogue of T-duality: two CY three-folds V and W have mirror symmetry^ if their 
cohomology groups (see Appendix A) satisfy Jf^'^C^) = (W) . As a consequence, their 

Hodge numbers are interchanged, i.e. /i^'^(y) = ^^'■'"('W), and vice versa. It turns out that type 
IIA theory compactified on ^ describes the same physics as type IIB compactified on W. Their 
moduli spaces then are the same, and we come back to this in the next Chapter. (It is, however, 
possible that two CY manifolds have the same Hodge numbers, yet disjunct moduli spaces.) 
Going beyond the supergravity approximation, one can also have "geometric transitions" , which 
describe a smooth change in topology. These can even affect the Hodge numbers of manifolds, 
and one example is the "conifold transition" relating the singular, deformed and resolved coni- 
folds we encountered earlier in the discussion of warped compactifications. Geometric transitions 
establish a link between backgrounds that contain D-branes (i.e. sources of C(„) gauge poten- 
tials), and warped backgrounds with fluxes only {i.e. with non-zero field strength = dC(-„), 
but no localized sources). This allows both to understand the presence and the quantization of 
flux in the latter backgrounds: the sources of the flux are explicit in the dual background, where 
they come in integer units because of the Dirac condition. 

We therefore made it plausible that the five different formulations of superstring theory are 
related by duality transformations and describe but one underlying framework. However, it turns 
out that there is a vast number of possible compactifications from ten down to four dimensions, 
both on Calabi Yau manifolds and on more general "warped" backgrounds. Each of these gives 

^The dilaton, together with its superpartner the axion, belongs to a chiral superfield usually denoted by S. 
®In the Hodge diamond, this looks like a mirror transformation along its central axis. 
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rise to a different string theory vacuum (in which, at this point of our discussion, supersymmetry 
in four dimensions is still conserved). Therefore, we may have a unique theory, but infinitely 
many realizations of low-energy physics depending on the properties of each vacuum. [Note that 
this situation is not unlike General Relativity, where the Einstein equations also admit many 
solutions (including higher dimensional ones) that do not describe our Universe. In the context 
of cosmology, arguments of simplicity and symmetry help us to select a solution.] In the context 
of string theory, this is often referred to as the "landscape" [163, 164]. In the absence of a good 
criterion^ for choosing one among these vacua, one can consider the statistical probability for 
finding e.g. a positive cosmological constant in one of them [165, 166, 167]. 
With the elementary and advanced string theoretic tools of Chapter 5 and the present one in 
hand, we are ready to embark on the adventure of searching for string cosmological models. 
Before we proceed, let us add both a general caveat and a precise motivation. It is partly by 
construction that inflation hides from our view most of its GUT scale origin because only a small 
window of scales (leaving the Hubble horizon towards the end of inflation) is accessible. The 
slow roll mechanism further reduces information to very few generic parameters constrained 
from observations. This makes detailed knowledge of the inflaton's interactions an accessory 
detail rather than a necessity. From a cosmologist's point of view, one may therefore ask what 
is to be gained from finding inflation's underlying theory - apart from intellectual satisfaction? 
Among the reasons to continue and extend the search for the (string theoretic or other) origin 
of the inflaton, let us cite the following two [9]: we will see that e.g. the geometric interpretation 
of stringy inflaton candidates lends justification to seemingly arbitrary quantities like the range 
of field values. On the other hand, since string theory is also aimed at describing the Standard 
Model of particle physics, it offers the hope of a complete understanding of reheating, since both 
ends of the theory (the inflaton and the degrees of freedom after inflation) are known. 



other than our own existence, an argument known as the "anthropic principle" 
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So far, we encountered several classes of scalars fields in inflation, among them 
the vast amount of moduli fields describing the (compactification of) the higher- 
dimensional geometry. In this Chapter we take a closer look at moduli fields, which 
can be of the complex structure or the Kdhler structure type, and lay the groundwork 
for understanding their respective stabilization mechanisms. 



7.1. Moduli Space 

We mentioned in Section 6.2 that the ten-dimensional supcrgravity formulations of string theory, 
when compactified down to d = 4 on Calabi Yau three-folds, lead to an effective theory with 
a large number of massless scalars. These fields span the so-called moduli space and describe 
the size and shape of the compactification geometry. More precisely, moduli fields come in two 
different varieties, those related to the complex structure of the compactification, and the so- 
called Kdhler structure moduli. An important property of moduli space is that it is (locally) a 
direct product of these two components, Mmod = x 3VCk. For a CY manifold y , the dimension 
of the two parts is related to the Betti number 62 and 63 of the CY by dim(Mc) = (&3/2) — 1 and 
dim(MK) = 62 [168]. In terms of Hodge numbers, dimc(Mc) = /i^'^(y) and dimc(MK) = h^'^iV), 
where dime is the complex dimension [151]. It follows that under the operation of mirror 
symmetry, discussed in Section 6.3, the two parts are exchanged, but the two mirror manifolds 
have the same product moduli space Mmod- 

Moving from a point Pi in moduli space to P2 corresponds to a continuous deformation of the 
parameters of the CY manifold Vi into those of a second Calabi Yau ^2 with the same Hodge 
numbers (but interchanged) . This means that the topology is not affected because the two Euler 
characteristics are related by xi^i) = —xi^2)- Fluctuations around a given CY Vi therefore 
parametrize ways in which the geometry can be deformed without changing the topology, and 
one may (at least locally) use them to define a metric on moduli space^. Because of the local 
product structure Mmod = 3Vtc x Mk of moduli space, one may study separately both the 
complex structure and the Kahler structure deformations around their values for a given CY. 



7.1.1. Compactification on the Torus 

As an example, it is instructive to consider compactifying two dimensions in bosonic string theory 
on a torus T, which is a compact CY one-fold with Hodge numbers h^'^ = h^'^ = h^'^ = h^'^ = 1 
and Euler characteristic xC^) = 0- We first focus on deformations of the complex structure 
moduli space only: the scalar components arising from the metric gmn after compactification of 

'^Note that this should not be confused with the metric on a given Calabi Yau manifold, which, as we mentioned 
earlier, is not known for non-trivial compact examples. 
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two dimensions {x, y) are 



' V^et 5 = 1, r = ri + ZT2 = \-i — , (7.1) 



512 522 J 922 522 



i.e. after fixing the overall volume (by setting y/Sstg = 1), there are two real parameters left. 
They can be combined into the so-called complex structure r of T. The metric on the torus can 
then be written as 

d4 = - [ (rf + r|) dx^ + 2ti dx dy + dy^l , (7.2) 

T2 

and one can define local complex coordinates dz = dy + rdx (and the complex conjugate), in 
terms of which the torus line element (7.2) reads dsj = 2gzzdzdz with Qzz = l/2r2. [The 
(dz, dz) define a basis of holomorphic (anti-holomorphic) one-forms on T, see Appendix A.] 
It can be shown (see e.g. [151]) that the complex structure moduli space of the torus (param- 
eterized by r) with one complex dimension is itself a Kahler manifold, which means that its 
metric Grf is hermitian {Gtt = Gff = 0) and admits a Kahler potential (see Appendix A) 

K{t, r) = - log (^iJdzA dz^ = - log (2r2) , Grf = drdfK = ^ . (7.3) 

Therefore, the line element on the complex structure moduli space is ds^ = drdr/2T|. Note 
that we have held the total volume of the torus fixed by setting ^/detg = 1. If we drop this 
restriction, there is additional scalar degree of freedom (the "radial modulus" ) in the total moduli 
space of 7. 

We know that the spectrum of the bosonic string at the massless level contains the Kalb Ramond 
anti-symmetric tensor field on top of the metric gmn- Let us see what moduli arise after 
toroidal compactification when both of these fields are considered. A priori, the resulting scalar 
components after compactification now are 

i.e. there are four real parameters, which can be redefined as 

512 , . Vdetg 

T = TI+IT2 = hi , (7.5) 

522 522 

P = Pi + ip2 = Bi2 + i V^det 3 . (7.6) 

As before r parametrizes the complex structure moduli space in this example, and since p2 = 
yjdei g, lm(p) is a measure for the total volume of the torus. Let us simplify to a rectangular 
torus (with radii Ri and R2 in the respective directions) for the moment. Then we have gi2 = 0, 
and the diagonal elements of g are given by gn = Rf, 522 = -Ri- Therefore the complex structure 
parameter is calculated as r = i {R1/R2), and p = B12 + iRiR2- 

We know from Section 6.3 that mirror symmetry exchanges the complex and the Kahler structure 
parts of moduli space, i.e. here it takes^ r -H- p. Therefore it follows that p must be purely 
imaginary if r is, and B12 = in this case. However, for general tori at an angle, i.e. with 
off'-diagonal metric elements gi2 7^ 0, there is a non-zero component B12, hence the fields g and 
B must be considered together. 



^On the torus, mirror symmetry is just T-duality, which exchanges J?i,2 — > i/Ri,2- 
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Figure 7.1.: Left: A torus with its two dual cycles A and 23, with radii i?i and R2. Top right: With the 
overall volume of the torus is kept constant, one can still change the radii of the cycles A and 
23 without changing the topology: the radii are moduli fields for the simple toroidal com- 
pactification. Bottom right: A symbolic representation of a Calabi Yau manifold typically 
used to compactify the internal spaces of superstring theory. Such a manifold can have 
many hundred moduli fields describing the size and shape. (Figure from Simple English 
Wikipedia) 



7.1.2. Moduli From Calabi Yau Three-Folds 



For compactification of the dint = 6 internal dimensions of superstring theory on a Calabi Yau 
manifold y, we again start with a simplified discussion of metric fluctuations 5 gab (though we 
know that the components of the Kalb Ramond field must be considered on an equal footing). 
These fluctuations parametrize the complex structure part Mc of moduli space. For a Calabi 
Yau three-fold, we are interested in deformations under which the topological property of Ricci 
flatness in the extra dimensions is preserved, i.e. one imposes both RABid) = and RAB^g + 
6g) = for indices A, B running over the internal space. (One additionally demands that both 
the perturbed and unperturbed metric are Kahler manifolds.) This leads to the Lichnerowicz 
equation for the perturbations Sgmm in which the ten-dimensional operator Aiqj^ appears. 
The existence for (possibly many hundred) moduli fields in four dimensions is seen when the 
Lichnerowicz equation is decomposed into the four-dimensional external and the six-dimensional 
internal piece. Metric fluctuations SgABi^^, y^) in the extra dimensions only then can be written 
as 

SgABix'^, y'') = Yl h^siy"') , (7.7) 
it 

where hAB{y^) are the tensor eigenfunctions of the Lichnerowitz operator in six dimensions 
Agd, with A6d/iAB(y*^) = —fn\hAB{y^^)- It then follows from the ten-dimensional equations 
that the mode functions (j)^ (which only live in four dimensions, i.e. they depend on the x^^) must 
satisfy (^4^ — m^) (/>fc = 0, which is the equation of motion of a scalar field in four dimensions 
with mass m^. Moduli fields are those modes for which the mass vanishes, i.e. a deformation of 
the Einstein equations in their direction comes without energy cost. 

It can be shown, however, that under T-duality/mirror symmetry, the fields gmn and -Bmn niix 
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(as it was the case for r and p in the simple example of the two-dimensional torus), hence at the 
same time as the perturbations dgmn-, one has to consider SBmn- Using the strategy of perturbing 
around a given Calabi Yau V to obtain the metric on moduli space Mmod = x ^K, one writes 
the most general variation of the moduli space line element as 

d^mod = / d'a^Vy [Sgac Sg-u + {^Qad^gcl " ^B^d^Bj,)] . (7.8) 

By V6(y), we denote the volume of the Calabi Yau three-fold. Note that the background metric 

elements gac and g'gj vanish because V is a Kahler manifold, but the perturbations in these 
components have to be considered. The line element (7.8) is then rewritten in terms of the 
fundamental (p, g)-forms admitted by the CY (which are counted by the Hodge numbers W'^). 
Luckily, the exact form of the Calabi Yau metric gmn itself is not needed. One finally finds that 
both components of moduli space, the complex structure part Mc and the Kahlcr structure 
moduli space Mk, are themselves Kahler manifods with Kahler potentials [compare Eq. (7.3) 
for 7\ [151] 

K^'^ = -log(i[ nAn) , K^'^ = -\og(^[ JAJAJ). (7.9) 

V J Me ) V 3 J Me / 

Here, the superscripts "2,1" and "1,1" translate the fact that the dimension of the respective 
component is given by the Hodge numbers /i^'^(y) and /i^'^(y), respectively. By we denote 
the (3,0)-form on V (which is unique up to a prefactor, and the wedge product A gives the 
volume), and J is the so-called Kahler form of V (see Appendix A). It can be shown that also the 
K^"^ potential can be expressed in terms of the volume, with exp(— i^-^'^) = | /"^^ J AJ AJ = 8 Vq. 
Note that via J, the second potential K^'^ contains the Kahler modulus field p, to which we 
return in Chapter 9. 



Integrating out heavy fields 

Let us return to a schematic notation and assume that we can expand each scalar field (^(x™, y^) 
arising from a higher dimensional p-form or the ten-dimensional metric as 

^(x^ y^) = ^m(x^) /m(2/"") , (7.10) 
m 

where the fmiv^^) are complete set of eigenfunctions of the appropriate wave operator^ on the 
internal space Me. In the ten-dimensional actions of Section 5.3, the full Lagrangian Ciq^ is 
integrated over all ten spacetime dimensions. To obtain the Lagrangian in four dimensions, let 
us integrate only over the internal coordinates, C^ai'Pmix'^)] = / <i^y ^iod['f{x^:y'^)]- This still 
depends on all ipm{x^^) regardless of their mass. To identify Lagrangian for the four-dimensional 
moduli fields (which are by definition massless), we split (p{x*^,y^) up into light (ie. massless) 
and heavy contributions, depending on whether they have a zero or non-vanishing eigenvalue 
under the internal wave operator. The heavy modes are integrated out, but they cannot 

^Lct us illustrate this on the example of the torus T again: in this case, the internal operator A is two-dimensional 
and in terms of the vector "counterparts" dz = (i/2r2) [f{d/dx) — (d/dy)] and dz of the one-forms {dz,dz), 
it is written as A = —2dzdz. (Note that we drop the restriction ^detg = 1-) The eigenfunctions and -values 
of this operator are 

27r^ 

frn,n{x,y)=e-xp[2m(ny + rny)] , Xm,n = {m - Tn){m - rn) . (7-11) 

'^2 
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simply be "set to zero". Instead, one must cxtremize the action with respect to them while 
holding the light fields fixed. After integrating over the internal manifold, the heavy fields in 
the expansion (7.10) contribute to the effective potential in the resulting dext = 4 theory, but 
they are not dynamic. The light modes, called moduli, do not enter into the potential, but have 
(potentially non-canonical) kinetic terms in the four-dimensional Lagrangian. 
The choice to compactify on Calabi Yau manifolds was motivated by obtaining a supersymmetric 
four-dimensional theory and we saw earlier in Section 3.2.3 that a (chiral) superfield Lagrangian 
is entirely fixed by prescribing a superpotential W (a holomorphic function, which, in our present 
notation means that it depends on the (pm only, and not on their complex conjugates) and a 
general function K (the Kahler potential, which can contain as well as 4>m)- The remaining 
step from superstring theory towards a four-dimensional Lagrangian of the familiar supergravity 
form therefore consists in determining W{(l)m) and K{(f)m, (pm) after the compactification. Note 
that we have just completed this step for the latter of the two functions: we obtained the Kahler 
potentials for both the complex and the Kahler structure moduli in Eqs. (7.9). We will see 
how the superpotential W is calculated in concrete backgrounds below. Before we turn to the 
question of moduli stabilization, we now briefly list the moduli field content of type IIA and 
type IIB supergravity. 

7.1.3. Moduli of Type IIA/B Superstring Theory 

The field content of the two theories in ten dimensions was listed in Sections 5.3.1 and 5.3.2, and 
we know that compactification on a Calabi Yau manifold in this case leads to dext = 4, = 2 
supersymmetry. The ten-dimensional metric gmn as well as the -B^„ field and the R-R gauge 
potentials give rise scalar zero modes upon compactification, and (since the lower-dimensional 
theory is still supersymmetric) these moduli fields belong to supergravity vector and chiral mul- 
tiplets analogous to the one discussed in Section 3.1.3. 

For type IIA superstring theory, these four-dimensional fields are /i^'^(y) abelian vector multi- 
plets and /i^'^(y) -|- 1 hypermultiplets in the four-dimensional theory. (The product form of the 
moduli space inhibits mixiing between these sets of moduli). Each vector multiplet gives rise 
to two real scalar fields, therefore the (real) dimension of the Kahler structure moduli space is 
dim(MK) = 2h^'^. Each hypermultiplet contains four real scalar fields, so dim(Mc) = 4(/i^'^-|-l). 
For type IIB, these dimensionalities are reversed: there are h^'^ abelian vector multiplets and 
h^'^ + 1 hypermultiplets. The additional moduli fields from the "+1" in both theories are due to 
the dilaton (and another axionic partner which comes from the four-dimensional Poincare dual 
of the two-form i?^,/). Note that the overall volume modulus corresponding to Vq{'^) always 
survives down to the four-dimensional theory because of the scale-invariance of the supergravity 
equations of motion. (This modulus is sometimes called the "breathing mode".) If there are 
non-perturbative objects such as D-branes present in the background geometry, they will give 
rise to additional moduli fields. 



7.2. Stabilization Techniques 

We saw that even the simplest compactifications of superstring theory leave us with a prolifer- 
ation of complex scalar fields in four dimensions. By definition they do not contribute to 
the effective potential V in four dimensions at lowest order because a variation of the Einstein 
equations in the direction of a modulus has no energy cost. Therefore, the moduli fields generi- 
cally are unfixed and not stabilized to a particular vacuum expectation value (which would make 
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them massive) at tree level. It is expected that loop corrections, supersymmetry breaking and 
non-perturbative effects generate a potential for moduli fields, which is typically very shallow. 
This description makes moduli fields sound like cut-out candidates for inflation. However, the 
obstacle to this is that for slow-roll inflation, the potential must be nearly flat in its steepest 
direction. In multidimensional moduli space this means that one must know the correction- 
induced dependence of V on all moduli fields at higher order before one can tell whether a given 
modulus can serve as an inflaton. Any mechanism that lifts the "flat directions" of the other 
moduli in the potential generically will also lift the inflaton flat direction. In the terminology 
of Section 7.1.2, one might say that integrating over the "heavy" fields among the modes of 
Eq. (7.10) produces contributions to the potential for the remaining "light" degrees of freedom, 
which can render their potential directions too steep for inflation. It was therefore not until 
significant progress was made in the direction of "moduli stabilization", dramatically reducing 
the number of dynamical fields in the four-dimensional theory, that inflationary model building 
in string theory could proceed successfully. 

Above, we argued that flux compactifications may allow to fix most or all of the complex struc- 
ture moduli: the fluxes threading internal cycles bestow an energy cost on the variation of 
previously unfixed geometric parameters. In the next section, we lay the ground work for under- 
standing the mechanism of flux stabilization techniques. However, the Kahler structure moduli 
still remain massless after turning on fluxes. Their stabilization requires non-perturbative ef- 
fects, and we come back to them in Chapter 9. 

Two examples of moduli of special importance arc the dilaton $ and the Kahler structure mod- 
ulus p fixing the overall size of the compactification manifold. Note that they parametrize the 
validity of the string loop and supergravity approximations, respectively. These expansions rely 
on (7s < 1 and < rUs, ie. the compactification manifold should be large with respect to the 
fundamental string scale. As long as $ and p are massless, they can be tuned to arbitrary preci- 
sion. In the presence of fluxes, however, the stabilization of the other moduli is usually bought 
at the prize of a certain loss of control over the Qs and supergravity expansions. Remarkably, 
there exists a fiux compactification solution in which the supergravity expansion is justified in 
the S> 1 limit, where N is the amount of fiux turned on. Moreover, in this solution, the 
dilaton $ is constant because it is "protected" by the remaining supersymmetry of the theory. 

Proceeding from here is difficult because "everything happens at once", and all steps in the 
stabilization process are closely intertwined. We therefore take a step back and return to the 
original ten-dimensional supergravity action: by deriving the properties of fluxes enforced by the 
equations of motion derived from this action, we develop an understanding for the construction 
of concrete flux compactifications in the next Chapter. Because it is the best understood, we 
shall henceforth focus on the case of type IIB supergravity. (Usually, the type HA dual to the 
constructions below is also known.) We postpone the stabilization of the "second half" of moduli 
space, i.e. the Kahler structure moduli such as the total volume, until Chapter 9. 



7.3. Ten-dimensional Solutions of Type IIB Supergravity 
7.3.1. Equations of Motion 

To understand the rationale behind the scenarios we study later, it is useful to go back to the 
type IIB supergravity action and explore the consequences of its equations of motion as well as 
those of the self-duality condition of F5. For the two scalars $ and Co, the equations of motion 
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derived from Eq. (5.60) are 

V„V-$ + ^e-*|//(3)p-e2*|F(i)p-ie*|F(3)|2 = 0, (7.12) 
V„.V™qo) + 2V^^>V"C(o)-e*(F(3)i/(3) + |^^3^|2q^^^ ^ q (713) 

In terms of the combined fields r and the three-form G(3) defined in Eqs. (5.59) and (5.62), these 
can be combined into the equation 

V„V"^T = -ie* \dmT\^ - e^G(3)G(3) , (7.14) 

where the last factor of G(3) is not a complex conjugate. In the following, the axion field C(o) is 
often be set identically to zero. From Eq. (7.13) we then see that in this case 

F(3)i7(3) = o, (7.15) 

i.e. the two three-form field strengths are orthogonal to each other. Recall that we know from 
the eventual splitting of the metric into internal and external pieces M4 x Me, with maximal 
symmetry preserved in the four extended dimensions, that fluxes must not have "legs" in the 
external M4. Therefore, the indices in Eq. (7.15) run over the internal manifold only. For 
vanishing C(q^ and constant dilaton field, Eq. (7.12), we also find that [note that F3 = F3 for 
C(o) = 0, see Eq. (5.57)] 

|iJ(3)|2 = e^*|F(3)r (7.16) 

Next, let us investigate what we can learn from the intrinsic self-duality of the five- form flux 
F(5). We can make an ansatz with "built in" self-duality as 

F(5) = (1 *) da{y) A dx° A dx^ A dx^ A dx^ , (7.17) 

where a{y) is a function of the extra dimensions only, and the "*" operator is the Hedge dual in 
ten dimensions. Recall that F(5) was defined as the gauge-invariant combination of Eq. (5.58), 
and that dF(-5) = d^B(3) = 0. Then, the Bianchi identity for F(5) takes the form 

dF(5) = A F(3) . (7.18) 

(Because of the self-duality, the second Bianchi identity d * F(5) gives the same equation.) If H3 
and F^^^ obey Eq. (7.15), we see immediately from Eq. (7.18) that di^(5) must be proportional 
to the volume of the internal manifold. 

Below wc will study the other R-R and NS-NS gauge potentials using their Bianchi identities. 
As a last step for now, we content ourselves with writing down the ten-dimensional Einstein 
equations in the bulk (i.e. without localized sources such as D-branes) [169, 170]: 

Rmn = \ 9mn dk^ - 2V„jV„$ - | Qmn 

+\ e'" F^(4)i^('^) + \ e'" G^J'^ Gl^^^ - ^ e^* g^n G(3) G<^^ (7.19) 

Note that only the last four indices are bound up in the square of the five-form flux, and 
that there is a symmetrisation with respect to the first index in the first term involving the 
complexified three form flux G(3) = ^(3) —rH^^y On the right hand side above, Eqs. (7.12) and 
(7.13) have been used to write the first line in terms of the dilaton only. Therefore we see that, 
for a constant dilaton, the first line in Eq. (7.19) vanishes and only fluxes from the C(4),C(2) 



98 



7.3. Ten-dimensional Solutions of Type IIB Supergravity 



and 5(2) gauge potentials appear as sources. 

Recall that for a metric ansatz of the form (6.6), the Ricci tensor will cleanly split into a four- 
dimensional and a six-dimensional piece, and that we argued that fluxes can only extend within 
the dimensions of the internal manifold. We can now appreciate the property of Ricci flatness 
{Rmn = 0) of CY manifolds: they are a solution for the internal compact piece of Eq. (7.19) 
in the absence of fluxes, i.e. when the right hand side vanishes. We now turn to more general 
compactifications of type (6.7), which allow a warp factor between M4 and Me. 

7.3.2. The No Go Theorem 

In a first step, we show that at leading order in the supergravity approximation of Section 5.3, 
any warp factor introduced into the metric via an ansatz like Eq. (6.7) must be trivial in the 

absence of localized sources for the fluxes [171, 172, 173]. The trace of the Einstein equations 
(7.19), calculated with the self-dual ansatz (7.17) for the five-form flux and the Minkowski metric 
iji^i, in extended spacetime, gives 

Aed/i-' = ^ e*|G3|2 + h {\daf + \dh-'f) , (7.20) 

where Ag^ here is the Laplacian on the internal manifold. If we were to integrate this equation 
over the internal compact space Me, the left hand side would vanish because it is a total deriva- 
tive. However, the right hand side is a sum of positive-definite terns, which only vanishes if each 
of the individual terms vanishes. Then, the three-form flux G3 must be zero, and the functions 
h{y),a{y) constant, i.e. trivial. This "no go theorem" for a non-trivial warp factor and non-zero 
fluxes was shown here for a Minkowski metric in four dimensions, but it can be generalized to 
anti-de Sitter and de Sitter spacetimes (to which we shall return in a moment). 
It is, however, possible to avoid the above no go theorem both by going beyond leading order in 
the supergravity approximation and by introducing sources (and possibly singularities) into the 
bulk background geometry [174, 168]. Wc already know that D-brancs carry charges under the 
R-R gauge fields, and we now show that they invalidate the no go theorem in another generaliza- 
tion from electromagnetism: the total charge enclosed in a compact space must vanish, because 
lines of field strength either have to go to sources, or to infinity, and the latter is impossible 
if the compactification volume is finite. In a compact space, there must therefore be an equal 
number of sources of opposite charge. 



D-brane sources 

If there arc localized sources present in the background, the contribiite a term on the right 
hand side of Eq. (7.20) which is cx /i^^^^Tioc, where Tioc denotes the trace of the Einstein tensor 
describing the localized sources. We discussed the Chern Simons and Dirac Born Infeld world 
volume actions for Dp branes in Section 6.1, and we also mentioned the possibility that branes 
can be "wrapped" along some of the internal directions of the ten-dimensional spacetime. (In 
type IIB, for example, this will be the case if the number of spatial directions p > 3.) We 
therefore write the action of a Dp-brane wrapping a (p — 3)-cycle S (at leading order and for 
vanishing fluxes on the brane) as 

= - / dP+'i TpV^ + /xp / C(p+i) . (7.21) 
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Our main interest is in branes with p = 3, since they can couple to the gauge potential C(4) 
behind the F(5) flux. Note, however, that to the gauge invariant flux -^(5), also the i^(3) and F(3) 
fluxes contribute, see Eq. (5.58). As we mentioned in Section 6.1.2 and will become important 
below, branes of higher p can also carry charges of the D3-brane type if they are wrapped (so- 
called fractional D3 charges) . In type IIB theory we should gcncrically also expect contributions 
from D5-branes wrapping two-cycles, and D7-branes wrapping four-cycles. 
Apart from an additional term on the right hand side of the Einstein equations, sources also 
change the Bianchi identity (7.18) for as 

dFs = A F3 + 2kio T3 pf , (7.22) 

where we have specialized to D3-brane (type) charges with tension T3 and density p^°^^ ■ The 
charge density pf°'^^ then typically contains (5-functions that specify the location of the D-branes 
within the compact dimensions. If this equation is integrated over the internal space Me, then 
it leads to the type IIB "tadpole cancellation condition" : 

^/^_^,„A% + g3 = (7.23) 

Here, Q3 is the total charge obtained from integrating over Mg. It can be shown that the 
requirement for G(3) to be imaginary self-dual means that fluxes H^^^ and F(3) are only induced if 
Q3 is negative. Recall that negative charges can be provided by anti-D3 branes or 03 orientifold 
planes. 

The condition (7.23) can also be rewritten in a purely geometric way if type IIB theory is 
"lifted" to its so-called "F-theory" description, where it is compactified on an elliptically fibered 
CY four-fold X. (The base of that fibration then is the original type IIB, and the fibration 
describes the running of the axion-dilaton.) Then, the tadpole condition becomes 



/ %)AF(3), (7.24) 



24 2ACior3 Jm, 

where x is the Euler characteristic of X, and N^s is the D3 brane charge present in the compact- 
ification. (The left hand side of this equation can be interpreted as the negative of the D3-brane 
charge induced by the curvature of wrapped D7-branes, which we mentioned earlier.) 



Fluxes from D-branes 



We can now derive the form of the fluxes sourced by D3-branes: these branes couple to the 
C(4) potential, therefore we expect a non-zero ^^(5). Let us set the other fluxes ^(3) and _ff(3) to 
zero for the moment. (This is possible while there are only "proper" D3-branes, but needs to 
be revisited if there are fractional D3-charges in the background such as wrapped D5-branes.) 
Under these assumptions, insert the self-dual ansatz for the five-form flux (7.17) into the Bianchi 
identity (7.18) and subtract the result from the trace of the Einstein equation (7.20), now taking 
into account the contribution of localized branes. This gives the constraint 

A{h-' - «) = ^ h-''\iGs - *Gsf + h\dih~^ - a)f + 2kw h'^/^ (tIoc - Tspf . (7.25) 
The solutions to this system of equations are then characterized by 

*G3 = iG3, h-\y) = a{y), TToc = Tspf (7.26) 
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where the last equation means that the sources involved should exactly saturate the "BPS-like" 
bound TIoc > ^apg"^'*. (This bound describes a general property of tension and charge for 
localized sources in string theory.) The BPS-like bound is exactly satisfied by D3-branes, and it 
is satisfied by anti-D3 branes (but not saturated, given that their charge is negative). D7-branes 
wrapped on four-cycles and 03 planes (other possible sources for D3-type charge) can saturate 
it, while D5-branes on wrapped two-cycles satisfy, but do not saturate the bound. 

The property of imaginary self-duality for the complexified fiux G(3) is the key to determin- 
ing the missing superpotential W we need for a complete description of the four-dimensional 
supersymmetric theory. We now show how this along with the form of the warp factor h{y) 
can be found in a concrete model: starting from the intuition of the so-called AdS/CFT cor- 
respondence, we consider a stack of N D3-branes embedded into a ten-dimensional type IIB 
supergravity background and study the spacetime geometry in its vicinity. 
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From AdS/CFT to Fluxes on the Conifold 



We saw how D-branes as sources of R-R flux can help avoid the no-go theorem for 
warped com,pactifications, and we now explain how this is related to the AdS/CFT 
correspondence and its generalizations. Of particular interest are compactifications 
on so-called conifolds, in which the fluxes backreact on the geometry to deform it in 
a singularity-avoiding way. On these backgrounds, all complex structure moduli can 
be stabilized while still preserving supersymmetry in four dimensions. 

8.1. The AdS/CFT Correspondence 

In Section 6.2.1, we split the ten-dimensional metric into a four-dimensional external and a 
six-dimensional internal piece [see Eq. (6.6)], using the Minkowski metric rj^i, for the former. 

However, the requirement of maximal symmetry (Lorentz invariance) in four dimensions could, 
apart from Minkowski space, also be met by a de Sitter (dS) or an anti-de Sitter space (AdS), 
ie. four-dimensional spacetimes with positive or negative cosmological constant, respectively. 
De Sitter geometry is what one may hope for in a phenomenologically successful construction 
(given that a small but non-zero positive cosmological constant is observed in our Universe), 
but it has proven notoriously difficult to obtain in string theory. For AdS space, however, a 
remarkable relation with conformal field theory (CFT) on p-branes has been found, which is 
referred to as the "AdS/CFT correspondence". The case important to us concerns type IIB 
superstring theory with D3-branes embedded in the ten-dimensional background. 
Consider A'^ coincident D3-branes in a higher-dimensional spacetime with structure AdS^ x 
5^, i.e. five dimensions compactified on a sphere, plus five-dimensional anti-de Sitter spacetime. 
As mentioned before, open strings with both ends attached to a Dp-brane give rise to gauge 
theories living on the brane world volume. In the case of a stack of N Dp-branes, the gauge 
theory can be shown to be maximally supersymmetric SU{N) Yang Mills^. The AdS/CFT 
correspondence then states that the low-energy world volume theory of these branes is dual to 
the string theory in the near-horizon (close to the branes) geometry of the bulk, in the sense 
that the two theories describe the same physics in different limits of "coupling" : when the gauge 
theory living on the branes is weakly Yang Mills coupled, the background geometry is strongly 
curved and vice versa. 

Dp-branes have tension, i.e. mass, and they carry charges under the C(J,_^.l^ gauge fields, which 
means that they can source fluxes and curvature in ten dimensions. Such backgrounds then 
have the "warped" form of Eq. (6.7), and we discussed them in Section 7.3. For our stack of 
D3-branes, sourcing the four-form potential C(4), N units of five-form fiux are threading the 
internal (recall that fiuxes cannot have legs in the extended dimensions). Again, this can 
be interpreted as a generalization of Gauss' law: the N D3-brancs (each of which carries unit 
charge) are inside the compact S^, where they can be moved around at random since they do 

^To be precise, in the absence of background fields and at lowest order in a' , the low-energy effective action on 
the brane is obtained from dimensional reduction of supersymmetric SU{N) gauge theory in ten dimensions 
down to the (p -|- 1) dimensional world volume. 
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not experience a force in this background (their R-R interaction and gravity cancel). In this 
sense, their charge is "smeared out" over the entire S^, and they are not properly "localized". 
Let us now calculate the warp factor in this setup. 



8.1.1. Warp Factor in an AdS5 x Background 

The ten-dimensional ansatz for the metric in an AdSs x background [compare Eq. (6.7)] is 

1 



ds 



lOd 



(8.1) 



where d^l^^ is the angular metric on the five-sphere, and we take the warp factor /i to be a 
function of the "radial" coordinate r only, independent of the angular position on the sphere. 
Because of the large amount of symmetry both in the internal and the external space, there are 
only two independent non-zero components of the ten-dimensional Ricci tensor. 
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and the Ricci scalar is found to read 
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If we add Eqs. (8.2) and (8.3), and consequently their respective right hand sides in the Einstein 
equations (7.19), we can eliminate the oc [dh/dr)"^ terms, while the right hand (source) sides 
exactly cancel. (This is true in general for the "bulk" terms, without assumptions about the 
dilaton or fluxes; since we do not consider the N D3-branes as localized sources, we did not 
introduce an energy momentum tensor Tmrf^ for them.) We obtain for h{r) that 
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0, 



(8.5) 



which is exactly the combination appearing in the Ricci scalar (8.4), hence our chosen internal 
manifold of a basis and a r radial coordinate is still Ricci flat. We can now exploit Eq. (7.20), 
which was obtained from the trace of the Einstein equations and the five-form self-dual ansatz 
(7.17). With the form of the Laplacian on the five-sphere, and for vanishing three-form fluxes, 
we find by combination with Eq. (8.5) that 
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(8.6) 



This is solved by setting a{r) = h~^{r), as it was anticipated in Eq. (7.26). [Note, however, that 
we have worked with Eq. (8.5), arguing that the D3-branes are not localized, but their charge 
smeared out, causing the F(5) flux.] Finally, the solution of Eq. (8.5) takes the form 



(8.7) 
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where Ci^2 are integration constants to be fixed by boundary conditions and by integrating 
the five-form flux over the sphere, which should give the total charge N, J^g F(5) oc A^, 
where the ansatz (7.17) with a(r) = h~^{r) should be used. Therefore, we see that the flux, 
in a generalization of the Dirac condition, is quantized. In particular, it can be tuned only in 
discrete units. 

Wc can think of the radial coordinate r roughly as the distance to the stack of D3-branes. The 
constant term Ci leads to a plateau at large r (i.e. far away from the stack of branes) where 
the second term dies out. In this limit where r 3> C2/C1, the warped "corner" of the overall 
CY should join into the bulk. Here, we are interested in the other, "near-horizon limit" of small 
r ^ C2/C1, where the metric takes the form^ 

ds?od - ^ ri^.Ax" dx" + ^ dr^ + R^d^l , (8.8) 

where the are the coordinates on the parallel D3-branes (which are aligned with the extended 
spacetime dimensions). R is the "radius of horizon", with = AirgsNa''^, and therefore C2 = -R^ 
in the notation of Eq. (8.7). The scale R is the characteristic curvature scale both for the AdSs 
and the five-sphere. On the brancs, the AdSs x corresponds to a = 4, d = 4 super 
Yang Mills theory with gauge group SU{N). We have J\f = 4 (instead of A/" = 8) because the 
presence of the branes breaks half of the supersymmetry. The SU (N) on the branes is UV finite 
(corresponding to large values of the r coordinate where the warp factor becomes constant), 
and conformally invariant. In the IR limit for r ^ 0, the warp factor seems to run into a 
singularity. We shall see shortly how this can be avoided if corrections to the geometry are 
properly taken into account. The Yang Mills coupling constant g of the gauge theory is related 
to the string coupling by g^ = ^Trgs, which illustrates the "duality" character of the AdS/CFT 
correspondence: since R^ oc g"^, the supergravity theory is strongly curved (small characteristic 
curvature R) when the gauge theory is weakly coupled. 

From Eq. (8.8), we can also make explicit the equivalence of warped backgrounds to Randall 
Sundrum models that was mentioned earlier [9]: if we re-define the r variable as r = ro e^/^, the 
ten-dimensional metric using ^ becomes 

dslod - Vixv dx^ dx^ + d^2 + R^dSll , (8.9) 

where the constant tq has been absorbed into the four-dimensional coordinates. The new variable 
^ plays the role of proper distance along the throat. Setting aside the dri| part, this metric is 
five-dimensional dc Sitter space, and we see that the warp factor varies exponentially quickly, 
h oc exp(— 4^/i?), in terms of proper distance. 

8.1.2. Warping on Einstein Spaces 

The five-sphere is an exceedingly simple choice for a compact space in string theory. One can 
generalize Eq. (8.8) as 

ds^ = /t-V2 ^^v ^ /jV2((i^2 ^ ^2^^2^ ^ jQ^ 

where ds| now stands for a so-called Einstein space X5, which is characterized by a Ricci tensor 
Rmn gmn (with Constant proportionality factor). Comparing with the ansatz of Eq. (6.7) for 
a general warped spacetime metric, we see that the six compact dimensions form a cone, with 

^This is obtained by treating the D3's as "black branes", the string theoretic analogues of black holes [175]. 
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radial direction r and the Einstein space as its basis. (We have written the warp factor with a 
subscript "3" to make it exphcit that we are working in the horizon hmit of a stack of "proper" 
D3-branes only so far.) For any ds5 7^ dOs, there is a singularity at the "tip" of the cone where 
r — 7- 0. The N D3-branes then are are localized at this singularity, instead of being "smeared 
out" over the in the non-singular case. 

The is referred a Sasaki Einstein space if it is an Einstein space, and if together with the 
radial direction r in Eq. (8.10) it gives a non-compact Calabi Yau (note that there is a singularity 
at the tip, therefore this is not really an manifold). CY spaces break 3/4 of supersymmetry, 
therefore the dual gauge theory SU (N) on the world volume now should have M = I (instead 
of = 4 for the S^). The formula for the curvature radius [provided one uses coordinates on 
the such that Rm'n' = 9m'n' {n^' J n' = 5, . . . ,9)] is 

i?^ = 47r5s a'^ ^ , v = Vol(X5)/Vol(5^) . (8.11) 

The parameter v is a convenient dimensionless measure for the volume of the cone basis relative 
to a five-sphere. There is an infinite family of choices of spaces X5, but the simplest non-trivial 
example is T^'^, which is SU{2) x SU{2)/U{1) and has the topology of x S^. (Hence, this is 
the topology of the cone at its basis.) Together with the coordinate r, the space T^'^ is called 
the "simple conifold". 

The X 5*^ basis beautifully illustrates the importance of geometry in string theory: we men- 
tioned earlier that Dp branes can wrap cycles if some of their p spatial directions lie along the 
compact coordinates. Therefore, the general type JIB background with D3-branes [which create 
the warped geometry Eq. (8.10) with ds^ as the Sasaki Einstein space T^'^] can, for example, 
also contain D5-branes wrapped on a two-cycle in the basis of the conifold. If there are M 
D5-branes wrapped, they provide M quantities of (fractional) D3-brane charge on top of the N 
"proper" D3-branes already in the geometry. However, at the tip where r = 0, this two-cycle 
shrinks to a point, making the conifold singularity manifest. 

In geometrical terms, one can think of two ways to avoid the singular behavior for r — )• 0: either 
the or can stay finite at the tip. These are called the "deformed" and the "resolved 
conifold" , respectively, and we now turn to compactifications on the three varieties of conifolds. 
Keep in mind though that "compactification" in this context is somewhat a misnomer, since the 
conifold geometries are non-compact: they are usually "cut off" at the characteristic radius R, 
where there are glued to CY bulk manifold, which, in turn, is compact. Deviations from the 
known conifold geometry can therefore occur in the UV end of the theory where the unknown 
bulk takes over. 



8.2. Conifold Compactifications 



Inspired by the AdS/CFT duality, a series of papers [174, 176, 177, 178, 179] presented solutions 
to the equations of motion of type IIB superstring theory in a background with N D3-branes and 
M fractional D3-branes. Wc now follow the line of development in these papers: the strategy is 
to place the regular D3-branes at the singularity in the AdS^ x X^ (where they act as localized 
sources of curvature and flux), and consider the M fractional branes as a perturbation. In terms 
of the dual gauge theory description, the stack of N branes would originally (by itself) have a 
conformally invariant SU{N) with J\f = 1 supersymmetry, but the fractional branes break the 
conformal invariance and lead to a renormalization group (RG) flow. With guidance from the 
behavior of this RG flow, one can draw conclusions for the effect of the fractional D3-branes on 
the dual string geometry. 
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8.2.1. The Simple Conifold 

We saw in Section 8.1.2 how a stack of N D3-branes gives rise to a conical background geometry 
with a characteristic radius R, the prime example being the simple conifold. A convenient 
way to describe the six-dimensional conifold geometry is in terms of four complex variables Wa 
restricted by the equation [180] 

4 

J2^l = 0. (8.12) 

a=l 

Note that this corresponds to two conditions, because the Wa are complex, i.e. Wa = w^a^ +iw^a'^ . 
[This component notation will be used shortly to define cycles within the conifold described by 
Eq. (8.12).] The basis of the conifold is the Einstein space T^'^, whose metric reads 

using five angular coordinates (^, ^i, 0i, ^2, ^2)- As we stated earlier, this is topologically a 

product of spheres 5^ x S'^ . Note the symmetry between the to pairs of angular coordinates 
((/>!, 61) and ((/>2, 6*2). For later use, let us define a basis of one-forms 5*, i = 1, . . . , 5, in terms of 
which the T^'^ Einstein space metric (8.13) reads 




ds^ 



TM = ^(.^f + ^E(yT. (8.14) 



1=1 



These one-forms are defined from 



where the e', z = 1, . . . , 5 stand for 

= — sm9id(f)i , = d^i , e'^ = cos sin ^?2 d02 — sin ■;/; d02 , (8-17) 

= sin sin 62 d^2 + cos ijj (102 , e'^ = dijj + cos 9i dcpi + cos $2 d(p2 ■ (8.18) 

It can be shown that the components of the Ricci tensor Rmn as well as the Ricci scalar cal- 
culated with the metric (8.13) arc exactly the same as those for the five-sphere we obtained in 
Eqs. (8.2), (8.2) and (8.4). However, given that the conifold, in contrast to the five-sphere, has 
a singularity where the stack of D3-branes is localized, there is now an additional source term 
Tmn^'^ (containing a (^-function) on the right hand side of the Einstein equations. We now show 
how this singularity can be avoided by an appropriate "deformation" of the simple conifold. 



8.2.2. The Deformed Conifold 



As announced above, we now add M fractional D3-branes (i.e. D5-branes wrapped on a two- 
cycle) to the type IIB supergravity background. It can be shown that in terms of the Yang Mills 
gauge theory living on the stack of N D3-branes, the SU{N) changes to SU{N + M) x SU{N) 
(with Yang Mills gauge couplings gi and g2), and while this theory is still supersymmetric, the 
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conformal invariance is now broken. This means that it undergoes a renormahzation group flow, 
the direction of which lies along the r-direction of the cone geometry. 

Using the so-called Seiberg duality, it can be shown that the gauge theory confines deep in 
the IR, i.e. in the limit r — t- 0. However, we saw that the supergravity theory compactificd 
on the simple conifold had a singularity in its IR limit, where the warp factor h oc blows 
up. However, the gauge theory and the supergravity geometric background are related by the 
generalized AdS/CFT correspondence - how can these different types of behavior be reconciled? 
In [176, 177, 178] both the supergravity background and the rcnormalization group flow of the 
SU{N + M) X SU (N) theory were studied closely together: in this way, physical intuition as 
well as calculational results obtained in one picture may be transferred and reinterpreted in the 
dual description. We now highlight the basic steps of this development. 



Fluxes from wrapped branes 



B(2) 



The M fractional D5-branes give rise to an R-R three-form flux contribution through a three- 
cycle of the T^'^, which is quantized as Jga F(3) oc M. In this sense, the wrapped D5-branes 
arc coupled to the C(2) R-R gauge potential. However, this R-R potential has a two-form NS-NS 
twin, namely -6(2)) with which it is grouped together in the vector !B(2) used in the 5'L(2,M) 
invariant notation of the type IIB supergravity action in Eq. (5.60). If there is an F^^) Aux, 
there must hence also be a non-zero NS-NS potential B(2) in the background, for which one can 
make the ansatz 

*/(r)a;(2), / a;(2) = 1 , (8.19) 

where W(2) is a two- form of the background we specify below. The second expression in Eq. (8.19) 
restricts this two-form, and is the cycle dual to on the (five-dimensional) T^'^ basis. (These 
will be made explicit in Chapter 11.) 

We focus on the case of constant axion-dilaton r, which from Eq. (7.14) tells us that G(3)G'-^^ = 0. 
The Bianchi identities for the imaginary self-dual three-form G(3) then read 

d * G(3) = zF(5) A G(3) , dG(3)=0. (8.20) 



The flux F(3^ created by the M fractional branes should be proportional to the closed three-form 

^(3) = dq2) 



on the T^'^ (so that dF(3) = dC(2) = is respected) 



F(3) ocMe^ A (^e^i A e'^^ - e^^ a e*^^ , (8.21) 



where the new one-form basis e* here is 

2 \ 



gv- = ^ i^dV' + ^ cos dcp.j , = ^ ' ^"^^ " 7! ■ ^^-^^^ 

The two-form oj^^) defined by the second equation in Eq. (8.19) in terms of this basis reads 

W(2) = (e^i A e-^i - a e'^'^^ . (8.23) 

The dilaton $ is constant, therefore it follows from the ansatz (8.19) that the NS-NS flux is 
given by ii"(3) oc d/(r) A (e^^ A e'^'^ — e^^ a 6"^^^ -yy^g now also set the Co scalar to zero; note that 
the constancy of r then imposes the conditions (7.15) and (7.16). 
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Because F(5) is self-dual, it has to be proportional to the sum of the volumes {i.e. the five-forms) 
on the AdS space and the compact volume. [Earlier, we came to the same conclusion using the 
Bianchi identity (7.18).] With the ansatz (7.17) used earlier, this was automatically respected 

by using the prcfactor (1 + *). As a consequence, F^^^ A i?(3) = 0, which wc can use in the first 
Bianchi identity in Eq. (8.20). It then follows from (the imaginary part of) this expression that 
the function /(r) in Eq. (8.19) obeys 

cxM, /(r)cxMlogr. (8.24) 

Therefore, the components of 5(2) grow logarithmically with r, hence its derivative il(3) oc r~^, 
which diverges in the IR for r ^ 0. Again, there is an "electromagnetic" intuition which makes 

this behavior plausible: like i^(3), the /f(3-) flux threads a three-cycle in T^'^, and towards the 
"tip" of the conifold, this three-cycle shrinks to zero, resulting in the divergence. 

Let us briefly comment on the order to which these solutions are valid: if M is kept flxed, and 
N goes to infinity, then the backreaction of the fluxes on the metric can be ignored to leading 
order in N. At first order in M/N, we therefore identified the AdSg x T^'^ with three-form 
fluxes turned on as the supergravity dual of the gauge theory SU{N + M) x SU (N) (which 
lives on a stack of N D3-branes in the presence of M wrapped D5-branes). [The dilaton only 
vanes at order 0{M'^/N^).] At larg e r (i.e. in the UV limit), the renormalization group flow 
of the dual gauge theory is correctly reproduced, but not in the IR for r ^ 0. On the other 
hand, in the gauge theory description the behavior of the RG flow is known at all scales, and 
in particular, there is confinement in the far IR. This can be put to use to determine the non- 
singular supergravity background which replaces the simple conifold geometry in the r — >■ 
limit, removing e.g. the divergence in the H(3) fiux. 



Following the RG flow 

The direction r lies along the RG fiow and what is more, the function /(r) calculated in Eq. (8.24) 
denotes the supergravity dual of the scale dependence for the Yang Mills couplings gi and 52 • 
Along the RG fiow, M = 1 supersymmetry of the gauge theory is preserved, which should lead 
to a corresponding feature in the dual supergravity description in ten dimensions. To obtain 
solutions of the full type IIB equations of motion at all orders, it seems promising to start from 
an ansatz for the metric that keeps the "AdS -|- compact space" splitting as well as the structure 
of the T^'^ at the basis of the cone, but allows for deformations. 



AdS "T 

2 



(8.25) 



dsl = ^ e'^ ( + ^ cos ei dcf>A + ^ e^^ ^ (d^^ ^ ^-^2 g, ^^2^ 26) 

\ i=l I i=\ 

The functions C depend on the internal coordinates only, and the conformal prefactor in front 
of the AdS piece is chosen such that after compactification, the Einstein frame is preserved. R is 
the characteristic scale of both the AdS and the compact space, as we saw earlier. Note that in 
Eq. (8.26), the symmetry between {(pi,6i) and {4>2,d2) (ie. between the two two-spheres inside 
the T^'^, of which one is then fibered over the ^a) is still preserved. Let the metric in the AdS 
piece have the form 

dsids = dii^ + e^^^^^dx^dx'^ . (8.27) 

(Note that u is not identical to the radial coordinate r used before.) This ansatz for the metric is 
supplemented by corresponding expressions for the two-form B(2) and the fluxes. As an example. 
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let us cite the self-dual five-form, which is written as F(5) = J^(5) -|- *^(5) (still using built-in 
self-duality), and given that it must be proportional to the volume forms as before, we have 

J-(5) = K{u) A A e^' A e^^ a e"^^ , (8.28) 
= e^^-^^/^'^^^+'^^'> K{u)duAdx^ Adx"^ Adx^ Adx\ (8.29) 

The Bianchi identities and the equations of motion obtained from the type IIB supergravity 

action then lead to a system of coupled differential equations of second order (in the coordinate 
u) for the functions A, B, C, K in Eqs. (8.26), (8.27) and (8.28), as well as the functions used to 
parametrize the other fields. 

How can one hope to find solutions for all of these functions at once? In [177, 178] the gauge 
theory-inspired fact that the solutions must preserve supersymmetry in a certain sense was used 
to reduce the system of equations from second to first order in u-derivatives. One remarkable 
feature of the resulting solutions is that the dilaton $ is constant at all orders in this background. 
Another other striking fact is that, under the influence of the IR confinement of the dual gauge 
theory, the conifold geometry undergoes a deformation, which we now describe. 



Deformation of the simple conifold 

The simple conifold as defined by Eq. (8.12) had a singularity at the point where all Wa = 0. In 
consequence of the additional three-form flux caused by the M fractional D5-branes, the conifold 
is deformed such that now 

4 

Y,wl = e\ (8.30) 

a=l 

where is a dimensionful parameter. In terms of a conveniently defined "radial" coordinate r, 
the metric on the deformed conifold is written as 

<^sl = r"'^{r)[-^^^[dr^ + {9'f] (8.31) 

+ cosh2 (0 [{g^f + {g'f] + sinh^ (I) [{g^f + {g^f 
where we have used the 5* basis defined in Eqs. (8.15) and (8.16) and the function K{t) reads 

^ ' 2V3sinhT ^ ' 

Note that at large r, we expect to recover the simple conifold because far from the tip of the 
cone the deformation should not play a role. Indeed, in this limit one can get back to the usual 
form of the T^'^ in Eq. (8.13) by setting ~ e^e^. Let us now turn to the other limit, ie. the 
tip of the cone. At r = 0, the metric (8.31) degenerates into 



d^l = ^ (2/3)V3 



(8.33) 



Using the expressions for the g^, one can check that this is the metric of a three-sphere S^. It 
is in this three-sphere that the -F(3) flux due to the wrapped D5-branes lies at the apex of the 
cone, therefore we now know that the correct integral is J^s ^(3) oc M. As we did earlier for 
the N D3-branes in the AdSs x construction, one may think of the M wrapped D5 branes 
"smeared out" over the of finite size at the tip of the cone. 
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The remaining two directions in Eq. (8.31), i.e. the 5^ fibered over the S^, vanish quadraticaUy 
as (1/8) 6^^/^(2/3) ^/^r^ [(5^)^ + (S'^)^] r^. A convenient way to fix the parameter e is to 
choose it such that the prefactor in this expression becomes 1, therefore e = 12^/^. Note that 
the symmetry of between the two two-spheres in the basis of the conifold [which went into the 
ansatz (8.26)] is still preserved. An illustration of the singular and the deformed conifold is 
provided by Fig. 8.1. 



Corrections to the warp factor 



Wc know that in the UV limit, the deformed conifold smoothly turns into its simple cousin. 
Recall that the stack of N coincident D3-branes is present in both cases (i.e. non-zero -F(5) flux 
sourced by the D3s), but the deformed conifold background on top has non-zero three- form 
fluxes because it contains M D5-branes wrapped on a two-cycle. On the simple conifold, the 
warp factor h{r) has the form (8.7). If wc keep r as a radial coordinate (instead of replacing it 
by r as required in the apex of the deformed conifold), we can ask whether there still exists a 
function h{r) which allows us to write the ten-dimensional metric as 



ds 



2 

10 



h~^^^{r) ??^z. dx" dx" + h^/^{r) (dr'^ + d4i,i) , (8.34) 



where h(r) defers from the simple warp factor h{r) by a correction due to the M D5-branes. 
Indeed it can be shown [178] that Eq. (8.7) in the presence of three-form fluxes is replaced by 
(see [181] for a review) 



h{r) = Ci + 



Att a' 



12 



gsN + a(5s My In — + a 



21 



ro 4 



(8.35) 



where a is a constant of order 1, and Ci again is the value of the plateau at large r. (One 
often sets Ci = if the focus is on the near- horizon limit close to the stack of D3-branes only.) 
The constant radius ro now corresponds to the IR end of the geometry, which is finite (instead 
of r = for the simple conifold). Remarkably, the deformation of the conifold by the three- 
form flux is modeled by a logarithmic correction to the D3-brane charge, i.e.N is replaced by 
N + agsM"^ ln(r/ro) + a (ggM'^/A), and one may speak of an "effective charge" depending on 
the position in the radial direction. The deformed conifold is often referred to as the Klebanov 
Strassler (KS) throat, and it is a crucial ingredient of the string cosmological scenarios we study 
in the following because all complex structure moduli are stabilized in this setup. 



8.2.3. The Resolved Conifold 



For completeness, let us briefly mention that the conifold family has a third member, the so- 
called resolved conifold. Geometrically, it defers from the deformed conifold in so far as now the 
S"^ (and not the S^) stays finite at the tip of the cone. The approach to solving the supergravity 
equations of motion on this background is the same as before, but now instead of the ansatz 
(8.26), the resolved conifold is ansatz is written such that it, while keeping T^'-*" structure, it 
allows for an asymmetry between the two S"^ contained in the basis [179]. One may again exploit 
the preservation of supersymmetry to reduce the second order system of coupled equations to 
a first order one, which may be solved for all fluxes, the metric and the dilaton (which is 
still constant). The metric of the resolved conifold is exactly known and in suitably chosen 
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Candelas & de la Ossa (1990) 
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X 

Klebanov Si Strassler (2000) 




^^^^ ^^^^^ 



tip 

•X 



Figure 8.1.: Left: The simple conifold of Section 8.2.1: over a basis of spheres S*"^ x 5*^, the radial 
coordinate r varies from its maximum value r^^ to r = at the tip. The geometry is 
singular at the tip because both spheres shrink to zero size at r = 0. Right: The deformed 
conifold of Section 8.2.2: The basis still consists in the product x S"^, but the radial 
coordinate can now only take values in the range from to a finite rg at the tip. The 
three-sphere stays finite at the tip, while the S'^ still shrinks to a point. The deformation is 
possible because the is stabilized by the three-fluxes of non-vanishing background fields 
threading it. 



coordinates (with a radial direction p) it reads 



ds 



iJ— d/9^ + ^ k{p) p^ (dV" + cos 01 d(/)i -I- cos 02 d(/)2)^ 
k{p) 9 



where the function k{p) is given by 



Hp) 



p^ + 9a2 
p^ + Qa? 



(8.36) 



^.37) 



The parameter a is both a measure for the "asymmetry" between the two pairs of angles (^i, (pi) 
and {02,<p2) and a singularity resolution parameter, as one can see from Eq. (8.36) in the IR 
limit (i.e. for p — t- close to the tip). 



[d, 



2 + 1 p2 (^2 ^ g2^ + e2 J + (a^ + ^ p2 



(8.38) 



The second term on the right hand side shows that this time the shrinks to zero size at the 
apex. As it was the case for the deformed conifold, in the limit of large p the simple conifold 
metric (8.13) is recovered. 

In Section 6.3 we mentioned that the three distinct geometries describing the simple, deformed 
and resolved conifold are, in the exact string theory description beyond the leading supergravity 
order, related by so-called geometric transitions. Reviews can be found in [182, 183], and the 
general idea is illustrated in Fig. 8.2. 



At the end of this rather technical Chapter, we pause to retain the key lessons learned from 
flux compactifications and the AdS/CFT correspondence. We saw that, respecting certain 
conditions resulting from the cancellation of tadpoles, one may embed Dp-branes of suitable 
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Figure 8.2.: The "confold transition" relates the three geometries discussed in Section 8.2. All 
three conifolds have a product x S"^ at their basis, but they differ in the limit 
r — 7- towards the tip. In the deformed conifold (left), the three-sphere of the basis 
stays of finite size, while the two-sphere shrinks to zero. The can be shrunk to 
zero to obtain the simple conifold (center) with a singularity at its tip. From this, 
the 5^ then is blown up to reach the resolved conifold (right). 



dimensionality into a the supergravity background. In the case of type IIB theory, these branes 
must have an odd number of spatial dimensions. They then act as "pointlike" sources for the 
R-R (and NS-NS) gauge potentials with an even number (p + 1) of indices, giving rise to (p + 2) 
fluxes. Branes wrapped along compact dimensions can provide "fractional" charges of lower p 
type. If fluxes run through cycles of the compactiflcation manifold, the shape of these cycles 
can no longer be changed at will (because it comes with an energy cost), and therefore the 
corresponding geometric modulus (of complex structure type) is stabilized. Moreover, these 
flux-enriched geometries typically have a warp factor h(r), which means that energy scales such 
as e.g. the tension of a brane depend on the position r. Expressed in the "proper" coordinate 
^, which measures distances inside the warped geometry, this dependence can be exponentially 
strong. The prime example of such a flux compactiflcation is the Klebanov Strassler throat, also 
called the deformed conifold. In the UV limit, it resembles the simple conifold, but unlike the 
latter remains flnite in the IR. In terms of the naive radial coordinate r, the warp factor in the 
KS throat receives logarithmic corrections, and moreover r runs over a flnite range of values, 
namely from the bottom of the throat ro to its edge ryv = R- Both of these are functions of the 
chosen flux quantum numbers, the string scale, and the coupling gg [see Eq. (8.11) and Chapter 
11]. In the next Chapter, we turn to the stabilization of the remaining part of moduli space, 
i.e. the Kahler structure moduli. 
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We saw that after compactification on a suitable Calabi Yau space with NS-NS 
and R-R fluxes turned on, all compex structure moduli are fixed. In this Chapter, 
we discuss how the "other half", i.e. the Kahler structure part of moduli space, can 
be stabilized, in particular the overall volume modulus: assuming it is the only field 
remaining massless in four dimensions, non-perturbative corrections to the superpo- 
tential are used to give it a (very heavy) mass. 

The decisive step towards fixing all moduli in the effective four- dimensional theory was presented 
in the work of Kachru, Kallosh, Linde and Trivedi [184] and is called the KKLT stabilization 
procedure. In this setup, the last massless Kahler modulus -the overall compactification volume- 
is stabilized by non-perturbative effects, leading to a supersymmetric anti-de Sitter minimum 
in four dimensions. Mildly breaking supersymmetry by adding anti-branes, this stable AdS 
minimum is then lifted to a metastable (but very long-lived) dS one. Let us now develop an 
intuition for this construction. 



9.1. Reminder: Supergravity Lagrangian 



For freqTient use in this and the following Chapter, we briefly recall the relevant expressions for 
the Kahler and the superpotential that describe a locally supersymmetric theory in four dimen- 
sions. These two functions completely determine the scalar part of a supergravity Lagrangian^, 
and while the superpotential W{(pm) is holomorphic, the Kahler potential (responsible for kinetic 
terms) is a general function of both (p^ and 4>rm -^(<^m) 4'rn.)- The Lagrangian is then written as 

= -K,j{<p, 4>) d^ct>'d^4^ - F((/), ^) , (9.1) 

where Kjj is the "Kahler metric" on field space, obtained from second derivatives of the Kahler 

potential, see Eq. (3.11). The F term in the scalar potential (where we have suppressed the 
corresponding subscript) can be obtained from the super- and Kahler potentials as [compare 

Eq. (3.9)] 

V = e^ {K'^ D^WDjW - 3\W\^) , DiW = W^i - K^^W . (9.2) 

We mentioned earlier that D^W is the order parameter for supersymmetry breaking, therefore, 
in a minimum of the potential Vp where in addition = 0, supersymmetry is preserved. 

Generally, the potential Vp is (by definition) precisely flat in the direction of moduli (p at tree 
order (as it was the case in "no scale" supergravity), i.e. these fields do not appear in Eq. (9.2). 
However, this situation is changed in the presence of the fluxes, which, as we argued before, result 

^Here we ignore the D term in the supergravity potential, and we do not consider the gauge kinetic function fab 
for vector multiplets. It arises if there are low-energy gauge fields present and coupled to the scalars. 
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in an energy cost for the changing the size and shape of geometric details in the background, 
while the same topology is still preserved. 

9.2. Kahler and Superpotential in Flux Compactifications 

In Section 7.1.2, we saw how to determine the two contributions K^'^ and K^'^ to the Kahler 
potential in the presence of background fluxes. The superpotential W can also be calculated 
from the flux compactification of (in our case) type IIB string theory based on arguments 
from the equations of motion and (imaginary) self-duality. Recall that one of the conditions 
characterizing flux compactifications with D-brane sources (see Section 7.3.2) was *G(3) = ^(^(s), 
where G(3) = i?(3) — tF(3) and r the axion-dilaton. We also know that both if(3) and F(3) are 
restricted to lie within the compact manifold Me, therefore the "*" operation actually acts on the 
six-dimensional internal piece of spacetime. Its imaginary self-duality under "^g" then restricts 
G(3) to have only pieces proportional to (2,1)- and (3,0)-forms on Me [174]. The constraint 
*G(3) = iG(3) can also be derived from the superpotential 

W = [ G(3)AJ^. (9.3) 

with $7 the (3,0)-form of the compact internal space. Let us now examine the conditions for 
supersymmetry to be preserved with this superpotential [151]. 

Suppose the CY manifold under scrutiny has (before fluxes are turned on) massless moduli fields 
corresponding to /i^'^ complex structure moduli (pa, the axion-dilaton r and a superfield called 
p which contains the (only) Kahler modulus. We then saw [compare Eq. (7.9)] that the Kahler 
potential for the cpa is 

K^'\(t>a) = -\og(i [ nAo) , (9.4) 

to which we add those for r and p, which are given by 

K(p) = -31og[-i(p-p)] , K{T) = -log[-iiT-f)] . (9.5) 

[Note the similarity with the no scale Kahler potentials of Eq. (3.10).] The total Kahler potential 
is given by the sum of the three terms in Eqs. (9.4) and (9.5), K = K'^'^ {(pa) + K (p) + K (t) . The 
order parameter for unbroken supersymmetry is DiW = diW + K^i W, where i runs over the (pa 
as well as r and p, and we need = to preserve supersymmetry. The remarkable property 
of p is that it does not appear in the superpotential, therefore DpW = 0, and more precisely 

DpW = dpKW = -(^-^^^W = 0, (9.6) 

therefore also W = Q holds for supersymmetric configurations. Above, using the argument 
of self-duality, we already restricted the possible pieces of G(3) to (2,1)- and (3,0)-forms. But 
from Eq. (9.3) we see that to have W = Q identically, G(3) can only be of (2,l)-form type (see 
e.g. [151] for a more detailed discussion). If, however, we had W = Wq with Wq a small constant 
calculated from the fluxes, this would tell us that i) supersymmetry is mildly broken and ii) 
that Wo must be due to a (3,0)-form contribution in W. This will be the case of interest in the 
following Section. Let us one add one more remark on the "no scale" form for of the Kahler 
potential for p observed in Eq. (9.5): this is a manifestation of the property that the three 
conditions characterizing the flux compactification solutions, i.e. the imaginary self-duality of 
G(3), the tadpole condition and the saturation of the EPS-like bound (relation between tension 
and charge) in Eq. (7.26), are all invariant under rescaling by a constant. 
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9.3. The KKLT Procedure 

The starting point for KKLT were warped type IIB compactifications of the Klebanov Strassler 
type which have non-trivial fluxes for their NS-NS and R-R potentials, see Section 8.2.2. Let 
the fluxes be of such form that all complex structure moduli are stabilized, but supersymmetry 
still preserved. Then the resulting four-dimensional supergravity theory is of the no scale type, 
and hence the overall volume modulus p remains unfixed. 

As before, it is assumed that p is the only unfixed Kahlcr modulus. [It is possible to construct 
explicit models with this property, and in terms of the Hodge numbers of the Calabi Yau it 
means that we have /i^'^(V) = 1.] One then includes corrections which violate the no scale 
structure, of which non-perturbative corrections to the superpotential are one example. Recall 
that the Kahler potential for p is Eq. (9.5), while its superpotential is constant, W = Wq. [As 
stated above, this arises from the (3,0)-part of the imaginary self-dual three-form fiux G(^y 
Note that with the corrections we are about to include, supersymmetry can be preserved even 
if Wq 7^ 0.] Once can explicitly check from Eq. (9.2) that the no scale cancellation takes place 
and that hence V = Q. This is true at leading order in both the a' and the expansion, i.e. in 
particular when the compactification manifold is large compared to the string scale. 

9.3.1. Fixing the Volume l\/lodulus 

We are looking for corrections breaking the tree level no scale structure. Two known sources 
of non-perturbative corrections to this superpotential are i) so-called instanton effects and ii) 
"gaugino condensation" . The former are caused by Euclidean D3-branes wrapped on four-cycles 
in the internal manifold, and the latter are due to non-Abclian gaiigc groups that can live an 
stacks of D7-brancs also wrapping four-cycles. At special points in moduli space, these gauge 
theories undergo gaugino condensation, and one can assume to be at such a point. It can be 
shown that these two corrections enter into the superpotential in the same functional form (the 
gaugino condensate looks like a fractional instanton effect), and we treat them summarily. The 
remaining modulus p is then fixed by a non-perturbative correction to the superpotential 

W{p) = Wq + A exp(-ap) , (9.7) 

with A, a constants. Note that we take the non-perturbative correction for the superpotential 
into account while keeping the Kahler potential only at tree level. This can be consistent if the 
size of the constant term Wq is very small. 

9.3.2. The Resulting Vacuum 

We now try to understand the vacuum structure with tree level Kahler potential for p, and the 
non-perturbatively corrected superpotential. Let us also assume that the tadpole cancellation 
condition [which, in its F-theory formulation, was given in Eq. (7.24)] has been solved by fiuxes 
only and there are no localized D3-branes in the background so far. We know that in a super- 
symmetric vacuum, we have DpW = 0. Before, in Section 9.2, we saw that using fiuxes but 
without the non-perturbative corrections we just introduced in Eq. (9.7), this translates into 
Wq. Now we admit a small constant Wo, generated by a (3,0)-piece of the G(3) fiux. 
The volume modulus p is a complex variable (as it was the case in our simple example of the 
torus in Section 7.1.1), but let us set its real part, which is an axion decending from a four- form 
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gauge potential, to zero. (We shall return to the role of this axion later.) This means that we 
have only p = ia. Take the parameters A, a, Wq to be real, and let Wo be negative. Then we 
find from DpW = with Eq. (9.7) and Eq. (9.2) that 

/ 2 \ 2 a2 -2aac 

Wo = -^e-"-^ (^1 + - acTej , V = '^-^^ , (9-8) 

which means that there is an anti-de Sitter vacuum at the critical value Gc- As a consequence, p 
(or (T, respectively) now has a potential and is stabilized, while supersymmetry is still preserved. 
Note that we need ctc to be large (compared to the string scale, where a' has been set to one in 
our present units) for the supergravity expansion to be valid. 



9.3.3. Breaking Supersymmetry 

This sucessfully fixes the last of the moduli without breaking supersymmetry, and the geometry 
of the four extended dimensions is anti-de Sitter space, with negative vacuum energy density at 
(Tc, see Fig. 9.1. Therefore, this system still needs additional ingredients to break supersymmetry 
and lift the vacuum energy to zero or positive value (ie.de Sitter space). However, this must 
be achieved in a way that does not ruin the stabilization we just engineered. The KKLT idea is 
to do so by adding an anti-D3 brane, or a small number thereof. (Note that anti-branes do not 
have world volume scalar moduli because there is a preferred position where D-branes minimize 
their energy.) Let us assume that too much flux is turned on, and that therefore the tadpole 
cancellation condition (7.24) is satisfied once we add the D3 . 

An anti-brane breaks all of the supersymmetry, hence the resulting four- dimensional theory 
cannot be captured by supergravity anymore. There is then much less control over the corrections 
to the calculation. In particular, the anti-branes would normally create a runaway potential for 
the volume cr, the imaginary part of p. (Recall that for the torus discussed in Section 7.1.1, also 
the imaginary part of the Kahler modulus p was proportional to the volume oc \/det g.) But at 
present we are in a warped background, therefore the damage done by supersymmetry breaking 
can be kept small if the contribution of the antibrane to the low-energy action is parametrically 
weak. At the bottom of a warped throat, this is the case: the anti-D3 prefers to sit at the IR 
end of the geometry where it can minimize its energy. The extra energy density provided by the 
anti-D3 then is 

^3 1 

where he parameter can be timed over an extremely wide range of values because it is given 
by the compactification as an exponential of the integers labeling the fluxes of the background. 
More generally, one can write the energy contribution of an undetermined number of anti-D3 
branes in a warped throat with a given warp factor ssV oa D/a^, where D depends on the value 

of the warp factor at the position(s) of the anti-branes. 

If the parameter D is finetuncd, there now exists a de Sitter minimum for the scalar potential 
because after adding the uphfting contribution, V reads (see Fig. 9.1) 

V=^^^(-aaAe-'"' + Wo + Ae-'''']+^. (9.10) 

Note that the global minimum is still at a — t- oo, i.e. in the decompactification limit because 
all sources of energy vanish in this limit. The potential around the local de Sitter minimum is 
steep, the more so the closer one requires the value of energy density in the minimum to be to 
zero. Therefore, p has become very massive. The position of the minimum is still at large a, 



SV = 2-^^, (9.9) 
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therefore the supergravity approximation holds. It can be shown that the de Sitter minimum 
obtained in this way is metastable, but stable enough to persist for a long time compared to 
cosmological time scales of O(lO^'^yrs). 



V 

V 

0.5r 1-2 




Figure 9.1.: Left: The exemplary potential plotted in [184] to illustrate Eq. (9.8). Here, V has been 
multiplied by a factor of 10"'^^ to show the behavior close to zero. The parameters are such 
that Wq — —10^^, A — 1, a — 0.1 in Eq. (9.8). The minimum at (a large value of) <Tc is of 
AdS type. Right: The same potential (again multiplied by 10^^) after uplifting by a term 
5V (X D/a'^ with D = 3 x 10~^ in Eq. (9.10). This turns the minimum at a new (still large) 
value of (Tc into a dS one. Note that the potential is very steep around this potential and 
hence the a modulus very massive. The ctc minimum is metastable (but longlived), with 
the global minimum still being reached in the decompactification limit cr — > cx). (Figures 
from [184]) 



For a long time, progress in string cosmology was stalled because the potentials of scalar moduli 
in the four-dimensional theory were not very well understood: they were either identically zero, 
or, if lifted by non-perturbative effects, showed a runway behavior. In the KKLT setup, one 
can control a runaway direction like the overall volume modulus in the special background of 
warped compactifications, and moreover lift it to a de Sitter vacuum using anti-D3 branes. This 
is possible because the tensions of anti-D3-branes are screened while they sit at the IR end of a 
KS throat. Following this development, the number of string inflationary scenarios, in particular 
of the "brane inflation" type, grew almost without bounds. We present a small selection of them 
in the next Chapter. 
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Following the development of stabilization teehniques for both complex and Kdhler 
strueture moduli, a large variety of string inflMionary scenarios was proposed. Dif- 
ferent kinds of scalar fields were used for the infiaton, and in this Chapter we give but 
a taste of different classes. Our particular interest is in models of "brane inflation". 

A considerable number of review articles on string cosmology and string inflation exists, recent 
examples being e.g. [21, 162, 185, 9, 186, 187]. An earlier overview (prior to the KKLT construc- 
tion) is given in [188]. All types of scalar fields in string theory (of which there are, as we have 
seen, many) have been used as infiaton candidates, among them moduli of the metric and the 
extra-dimensional gauge fields (among them axions), the distance between (anti-)branes, the 
dilaton itself etc. One cannot hope to do all of them justice, and we restrict ourselves to two 
short examples of "moduli inflation" before turning to the "brane inflation" class of scenarios. A 
subset of the latter class, brane-anti-brane inflation in a warped throat geometry, stands behind 
most of the original work presented in Part III, and we return to it in detail in Chapter 11. 



10.1. The "Racetrack" Model 



Instead of stabilizing all moduli fields, one can explore whether interesting infiationary scenarios 
arise by keeping a manageable number of them dynamic in four dimensions. For example, 
consider a slight generalization of the previously introduced non-perturbative superpotential for 
a Kahler modulus (f), 

W = WQ + Ae-'"^ + Be-^'t' , (10.1) 

which is historically called the "racetrack" superpotential. The inflating direction (see Fig. ??) 
in this case is the axionic (real) part contained in the modulus p, and hence the "racetrack" 
model is an example of the axion inflation class of string cosmology [189]. In a later version, 
which can be derived rigorously from a specific Calabi Yau compactification on the manifold 
Pj^ 116 9] ("better racetrack inflation" [190]), two complex moduli fields 0i and 02 are used, the 
Kahler potential for which is 



and the non-perturbative superpotential is computed to 

W = Wq + Ae-'"t>^ + B e-^^'' 



2N3/2 



(10.2) 



(10.3) 



where A, B, a, b are parametrically calculable constants. As in the KKLT construction, the 
resulting AdS vacuum can be uplifted to a dS one by adding anti-D3-branes. The resulting 
scalar potential V then has a rather complicated form as a function of the four real fields 
7-1,2^^1,2^ where cr' = Q{<^i). This illustrates a general feature of many string inflation models: 
if the field space has several dimensions, the inflationary trajectory is often complicated, and 
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Figure 10.1.: The scalar in- 
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flaton potential 
in the race- 
track model 
of inflation, 
where the Y 
direction is the 
imaginary part 
of the Kahler 
modulus (j), 
and X the real 
part. (Figure 
from [189]) 



one cannot hold other real fields fixed while making a single real or imaginary part of a scalar 
field dynamic. Inflation is not generic in the "better racetrack" model [it is very sensitive to the 
values of A,B,a,b in the superpotential (10.3)], but can happen for specific parameter choices. 
However, it is not yet clear whether the required parameter values can be achieved from the 
particular underlying CY manifold [9]. 



In the KKLT stabilization procedure, the value of the constant superpotential Wq was very small. 
In "Kahler moduli inflation" [191, 192, 193, 194], this constant is not so severely restricted, at 
the price of perturbative corrections (in a') to the Kahler potential becoming important. This 
can produce new minima in the resulting scalar potential, but several moduli fields are required 
for this scenario. In the simplest case, three fields 4>^ are involved: while the Kahler potential 
is of the same type as in Eq. (10.2), plus a perturbative correction, the superpotential for the 
three Kahler moduli reads 



Again denoting the real parts of the scalar fields by r*, this potential can lead to inflation in 
the regime where one of them, e.g. is much larger than the other ones. Then is the most 
important direction in field space, and the potential has approximatively the form [9] 



where is the canonically normalized variable along the r"^ direction. The field values of 
must be sufficiently large to have slow roll. Compared to racetrack inflation, this potential is 
less dependent on the precise tuning of parameters. 



10.2. Kahler Moduli Inflation 




(10.4) 



Vc^Vo-C {t^)^/^ exp -c(r3)4/3 



(10.5) 



119 



Chapter 10. Exemplary Models of String Inflation 



10.3. Brane Inflation 



In the two examples above, the inflaton was associated with a modulus of the entire compact- 
ification manifold. In "brane inflation" , one instead considers world volume moduli associated 
with the positions and properties of embedded branes within the ten-dimensional string geom- 
etry. This class of models has given particularly rich offspring, and some scenarios have been 
investigated in very fine detail. (One of these is presented in the next Chapter.) Here, we 
assemble some historic developments and general features of brane inflation. 



10.3.1. Brane Inflation Prehistory 

The first string inflationary scenario using D-branes is due to Dvali and Tye in 1998 [195]: two 
parallel BPS branes (ie. they preserve half of the original supersymmetry in the background) 
do not feel any force between them, due to the cancellation of their gravitational interaction and 
the R-R electrostatic repulsion. (The dilaton $ also contributes an attractive force between the 
branes, but we will not mention it explicitly because in the scenarios we are eventually interested 
in, the dilaton is stabilized at a fixed value, i.e. gs is a constant.) To show the vanishing of the net 
force in terms of explicit string amplitudes, one would have to calculate the "cylinder" diagram 
for exchange of closed strings between the branes, see Fig. 10.2. (Note that this is the same 
diagram as for the exchange of open strings at one loop order). 

The idea presented in [195] was that after supersymmetry is broken, only the graviton would 

stay massless, and with a massive R-R mode, subject to Yukawa suppression, the cancellation of 
forces would no longer be perfect. Instead, it would be replaced by an attractive force between 
the branes. The effective potential proposed by Dvali and Tye had the form [188] 

^~2r+^('l + 5]e-Ns'--25]e-H'-) , (10.6) 



NS R 



where r is the brane distance, and d the number of compact spacetime dimensions. The first 
term in the brackets is the "cosmological constant" provided by the brane tensions, and the 
other two terms are due to the brane interactions, which cancel when the masses of the carrier 
modes are zero. If they arc massive, the cancellation is no longer exact and the potential is very 
flat, possibly suitable for inflation with r as the inflaton field. However, it is difficult to underpin 
this intuitive picture with actual calculations, and moreover, there is no natural explanation of 
the end of inflation and the transition to the reheating era. 



10.3.2. Brane-Anti-Brane Inflation 

Following the proposal of [195], the attraction between D-branes and anti-D-branes was consid- 
ered, for which the R-R and graviton interactions do not cancel, but add up, since these branes 
have opposite charge [196]. Moreover, in this case the cylinder diagram can be computed explic- 
itly. The corresponding calculation at the level of the supergravity action in type IIB theory is 
presented in the next Chapter, here let us state that the potential for the inter-brane distance 
r is found to read [188] 

nO-^-^' (10-7) 

where do is the number of dimensions orthogonal to the brane/anti-brane world volume. The 
constants A and B can be calculated explicitly in a given setup. In the case of the model 
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Figure 10.2.: Left: Closed string exchange at tree level between D-branes. The cylinder diagram 
can be exactly calculated, and describes the exchange of graviton and R-R modes 
(as well as dilaton modes) between the branes. Right: During brane-anti-brane 
inflation, the distance between the branes decreases because they attract each other 
both gravitationally and because of their opposite R-R charges. The brane distance 
r plays the role af the inflaton. Inflation ends when the branes annihilate, releasing 
the energy stored in the system. 



considered in Chapter 11, they are determined by the higher-dimensional geometric background 
into which the branes are immersed. 



"D-celeration" 

In Sections 1.2.4 and 4.2 we discussed the generalization of the standard inflationary scenario 
(with a canonical scalar field rolling down a flat potential) to modifled kinetic terms for the 
inflaton. These scenarios were dubbed "A;-inflation" in Part I, and they can be used to capture 
an interesting features of an inter-brane distance inflaton: as we shall see in the next Chapter, 
from the DBI action of a D-brane (6.3) one can show that the kinetic term of a D3-brane 
moving along one direction only is cx [1 - df,(l)d>'(j)/T{(j))]~''\ Here, T(0) denotes the position- 
dependent brane tension, and <j) is related to the radial coordinate r by a renormalization. 
For small velocities, the square root can be expanded and the inflaton kinetic term becomes 
canonical, but if (p starts to fast roll, its true DBI dynamics can lead to additional e-foldsof 
inflation even if the potential is no longer flat. This has been termed "acceleration from D- 
celeration" in [197], and a model in which (most of) the inflationary expansion is obtained in 
the fast roll regime is called "DBI inflation" [198, 199, 200]. In general, the DBI dynamics are 
present in any model where the inflation is an open string mode like an inter-brane distance, 
but it must be checked whether they in fact do play a role during inflation: for example, in 
the model investigated in Chapter 12, the DBI regime is reached more or less at the same time 
as the end of inflation, therefore the expansion, up to small corrections, occurs entirely in the 
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slow roll regime. In Chapters 13 and ??, we study models with A;-inflationary/DBI kinetic terms 
in detail with respect to their modified perturbation spectra and in the context of "stochastic 
inflation", which was introduced in Section 2.4.2. Because they make distinct predictions about 
non-Gaussianitics and the tensor-to-scalar ratio (see Chapter 13), DEI models are one way to 
distinguish string from usual field theory inflation. 



Remarks on reheating 

In our introductory remarks on cosmology in Part I, we emphasized the importance of the 

reheating era because it bridges the gap from the end of inflation to the onset of the Standard Big 
Bang Model evolution. In brane-anti-branc inflation, reheating has an intuitive interpretation 
as the annihilation between brane and anti-brane, which liberates their energy (i.e. tension), see 
Fig. 10.2 [196]. Brane annihilation sets in when the system becomes unstable because the brane 
distance is of order of the string scale: a tachyon appears in the spectrum of a string stretching 
between the brancs, making the instability manifest that leads to decay into closed string modes. 
The system's energy must then be channeled to the Standard Model degrees of freedom. 
First, all energy stored in the brane-anti-brane system is transferred into closed string modes 
produced in the annihilation. These closed strings can penetrate the bulk and travel to the 
"corner" of the CY compactification where the Standard Model particles live (see Fig. 6.3, 
e.^. on another (stack of) D-branes in a second throat. Channeling the energy into the SM 
degrees of freedom can be difficult, and their coupling to the closed string modes not very 
efficient. Multiple throats attached to the CY bulk can have different length and energy scales: 
it seems that a long throat would be favorable for faster reheating [201, 202], as it is also from 
the point of view of building a hierarchy. 



Cosmic strings 

At this point, let us briefly comment on the production of cosmic strings in the process of 
brane-anti-brane annihilation. A pair of D3-D3 that annihilates will produce Dl- and Dl- 
branes (D-strings) and Fl-strings (fundamental strings), and these can stretch to cosmological 
scales, where they appear as cosmic strings (see e.^. the reviews of [188, 186, 9] and [203]). Note 
that in type IIB theory, there are no objects such as pointlike DOs or D2 domain walls, therefore 
cosmic strings are the only type of defect produced. 

Once considered an alternative mechanism for seeding structure in the Universe, cosmic strings 
have lost this battle fair and square to the density perturbations of the inflaton, which correctly 
predict the observed coherent peak structure in the angular power spectrum. At present, the 
contribution of cosmic strings to the CMB anisotropies is bounded at the order of a few percent, 
see e.g. [204, 205, 206]. They are nevertheless interesting because they can form scaling networks, 
and would leave characteristic imprints on the CMB maps, see e.g. [207]. One can also infer an 
upper bound on their tension /i, and their number density, from observations. One advantage 
of the particular brane-anti-brane inflation setup presented in the following Chapter is that it 
"redshifts" the production rate and tension of cosmic strings [208] . It is important to keep in 
mind, however, that cosmic (super-) strings are interesting as a second way to tell string-inspired 
inflationary models apart from purely field theoretic ones. 
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Secrets of success 

Brane inflation models increasingly take over the field of string cosmology in the recent past. 
There are several reasons why this is rightfully so: brane inflation is a particularly robust 
and versatile subclass of string cosmological modes [186]. The attractive potential of Coulomb 
type between brane and anti-brane is intuitively easy to understand and technically not too 
difficult to calculate (see the following Chapter). We already mentioned that it has to be put 
into warped compactifications to be fiat enough for inflation, as we show in detail in the next 
Chapter. Fortunately, this type of compactification is favored also from the point of view of 
realistic string theory constructions. An additional mass term oc (j)^ (on top of the Coulombic 
interaction) in the inflaton's potential, while jeopardizing slow roll inflation, can nevertheless 
lead to additional e-folds in a DBI regime due to the non-canonical kinetic term of an open string 
mode inflaton like the brane distance. Hence, enough e-folds of inflation are easily achieved in 
brane inflation scenarios. The amount of parameter fine-tuning required will also be addressed 
later. 



10.4. String Inflation: User's Manual 

We end this Chapter with some guidelines for the handling of string cosmological scenarios. Note 
that, since this subject bridges the gap between two a priori very different domains of physics, 
there are different vantage points: from a theoretical point of view, one may be interested in 
the question whether a model is in the closed or open string class. As a cosmologist, one may 
choose to rather focus on the regime in which inflation occurs (i.e. standard slow roll or DBI, 
large or small range of field values). 

The importance of a given scenario's string origin can be estimated by a comparison of the Hubble 
scale H during inflation to both the string and compactification scales and rUc, respectively 
[9]. If H is 0(ms), the analysis requires the complete (and complex) machinery of string theory, 
but if the Hubble parameter is in the range rric < H < m^, one may reduce one's efforts to 
the study of a higher-dimensional field theory in which the physics of extra dimensions still is 
important. In particular, the full supergravity equations in ten dimensions must be consistently 
solved. However, if H is below the compactification scale, a strictly four-dimensional description 
of the inflationary phase suffices because the available energy is too small to make the physics of 
the extra dimensions "visible" {e.g. via excitation of their KK modes). The exemplary scenarios 
presented in this Chapter are in this last category, and it is not surprising that it can be difficult 
to tell them apart from their purely field theoretic cousins. 

Ideally, the effective Lagrangian £44 of any string-derived scenario were completely fixed by 
a data set describing the underlying string compactification. This data set should tell us the 
precise geometry and topology of the compactification manifold as well as the locations of any 
embedded D-branes and orientifold planes (as well as other sources) [185]. One would also like 
to know how many units of n-fiux are threading which geometric n-cycle in the CY manifold. Up 
to corrections in a' and gs (and backreaction effects), this compactification data would encode 
all free parameters of the four-dimensional theory. 

In today's string-inspired models, we are far from this idealized picture. Usually, a general sta- 
bilization framework is invoked for the moduli (where the precise form of the fiuxes is unknown) , 
and (anti-)branes or orientifold planes needed for tadpole cancellation are often "outsourced" to 
some corner of the overall Calabi Yau manifold, where they do not interfere with the dynamics of 
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inflation. However, since the metric on the compact CY is not known, most models are restricted 
to events inside well defined areas such as the non-compact KS throat, where calculations are 
under control. 

Corrections beyond the leading supergravity order (i.e. in a') as well as in gs are typically 
neglected. While not forbidden per se, this approach demands special care: any model which 
seems promising at leading order must be investigated for its stability against higher order 
corrections. For example, in the famous case of the KKLMMT branc inflation model, whose 
details are the subject of the next Chapter, quantum corrections lead to significant modifications 
of the original proposal. 
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Using the background material of the previous Chapters, we now discuss in de- 
tail the so-called "KKLMMT model" of brane- anti-hrane inflation, retracing each 
step from the ten- dimensional type IIB supergravity equations to the resulting four- 
dimensional inflationary cosmology. The comparison of this model's parameters to 
CMB measurements is the subject of a subsequent article (Chapter 12). We also dis- 
cuss more recent work on quantum corrections to the original KKLMMT scenario. 

An astute combination of the last Chapters' string model building tools was presented in 2003 by 
Kachru, Kallosh, Linde, Maldacena, McAllister and Trivedi [208] : in a type IIB flux compact- 
ification which stabilizes all complex structure moduli, they assumed that only the Kahler mod- 
ulus p corresponding to the total compactification volume remains unfixed. This last modulus 
is then stabilized by a non-peturbative superpotential as explained in Chapter 9, still preserv- 
ing supersymmetry. By introducing an anti-D3-brane at the bottom of the warped geometry, 
supersymmetry is mildly broken and the minimum of the potential lifted from an anti-de Sitter 
to a de Sitter one. If a light test D3-brane then is launched far away from the throat's bottom, 
it feels only a small attractive force towards the anti-D3. The distance between D3 and D3 is 
interpreted as the inflaton field, and the resulting inflationary model, called the "KKLMMT 
scenario" , has been abundantly studied in the literature. We now explain the separate steps of 
the KKLMMT construction. 



11.1. KKLMMT in a Nutshell 

We begin with a non-technical overview of this scenario: consider a D3- and an anti-D3-brane in a 
ten-dimensional type IIB supergravity background. Both branes are in the static gauge, i.e. they 
are parallel and the 3+1 dimensions of their worldvolume are aligned with the {x^ , x*), i = 
1 ... 3 of the background (which are the external, non-compact dimensions) . In the six extra, 
compactified dimensions, the brane and the anti-brane are pointlike and are separated by certain 
distances y^, A = 4 ... 9. We set all but one of these separations (the "radial" distance y^ = r) 
to zero. (The scenario can easily be turned into a multifield inflation model by making more 
than one of these distances dynamic.) It was then assumed in [208] that the geometry of the 
compact spacetime is described by the Klebanov Strassler throat discussed in Section 8.2.2. 
Into this warped but non-singular background, one then embeds one additional anti-D3 brane: 
seeking to minimize its energy, the anti-D3 will sink to the bottom of the throat (where it 
stays fixed at r = ro). Since it is an additional source of flux and curvature, its effect on the 
background is calculated as a small perturbation to the deformed conifold. A test {i.e. "light") 
D3-brane (inserted at position n) then probes this perturbed background, that is, it does not 
affect the geometry itself, but experiences forces due to gravity and Ramond-Ramond interaction 
with the anti-D3. Note that the D3 is sitting at the bottom of a KS throat with strong warping 
and therefore its tension and R-R charge (both of which attract the D3) appear redshifted when 
seen from a position up in the throat. To calculate the interaction potential between brane and 
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anti-brane, it is in fact easier to consider the former as heavy, and have the anti-brane act as a 
probe, and we will do so below. This evidently gives the same result for their mutual interaction. 
The distance r = ri — ro between the anti-D3-brane and the {N+ l)st test D3-brane corresponds 
(up to normalization) to the inflaton field 4), and its potential V{(j)) can be calculated from the 
potential V{r) experienced by the test D3-brane in the limit where r <C 4) 4 being the string 
length. Inflation therefore takes place while (j) "rolls down" the (extraordinary flat) interaction 
potential V{(f)), which corresponds to decreasing radial distance r between brane and anti-brane. 
When the branes become too close, r ^ Is, inflation ends due to the appearance of a tachyon, 
i.e. the long distance potential V{r) ceases to be a good description of the brane interaction. 

In this overview, we glossed over two essential features of the KKLMMT construction: following 
the introduction of fluxes, all of the complex structure moduli have been stabilized, but at least 
the overall volume modulus still remains unfixed. As mentioned above, one can use the non- 
perturbative KKLT stabilization procedure for the modulus p, which was discussed in Chapter 9. 
However, this turns out to be non-trivial in the presence of mobile branes. Moreover, one must 
make sure that the minimum of the potential has a positive non-zero cosmological constant, 
i.e. the AdS minimum after moduli stabilization must be uplifted to a dS minimum. We first 
derive the naive interaction potential between the branes pretending stabilization of the total 
volume does not interfere, before turning to the challenges posed by these issues. 



11.2. Brane Interaction Potential 
11.2.1. Klebanov Strassler Throat and Fluxes 

In type IIB superstring theory with three-form fluxes i?(3) and F(3) turned on, the complex 
structure moduli can be stabilized by choosing the (non-compact) Klebanov Strassler solution 
of Section 8.2.2 for the six additional dimensions. At its tip ro, the KS throat stays flnite 
because the in its basis is stabilized by the F(3) flux threading it. Let us now consider this 
in somewhat more detail. 

The definition of the deformed conifold in terms of four complex coordinates Wa = w^'^ + is 
given by Eq. (8.30). "Inside" this definition, two (Poincare dual) three-cycles A, "B are described 

by 



A 



Note that A is just the remaining finite at the tip, and the dual three-cycle S can be 
understood as the 5^ of the basis plus the radial coordinate, see Fig. 11.1. (Since the KS throat 
is attached to a compact CY manifold in its UV limit, this cycle is also finite.) It is along A, S 
that the three-form fluxes are aligned, with 

where M, K are integers, illustrating the generalized Dirac quantization condition for non- 
localized sources. Moreover, we know from the Bianchi identities of the gauge invariant self-dual 
five-form flux ^(5^ that dF(5) = iJ(3) A F^^•^ , therefore the number N of dissolved D3-branes that 
originally caused the warping is related to K and M by N = M ■ K. 

In the UV, the KS throat ends at r = i? (where it is joined into the compact Calabi Yau bulk 
manifold). Let us state again the relation between R and the background parameters given in 
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Eq. (8.11), 

i?^ = 47r c/s a'^ — , • (11-3) 

V 

where v = 16/27 for the particular case of the deformed conifold. We now want to find a similar 
expression for the IR end at tq. To this end, recall that the superpotential W on the deformed 
conifold has the form [compare Eq. (9.3)] 

W=[ Gf3) An = {27rf a' (m [ n- Kt [ n] , (11.4) 

where in the second equality, it has been used that G(3) = ii"(3) — r F(3) and that A and !B are 
Poincare dual on the internal manifold. In addition, the integrals over i?(3) and ^(3) have been 
performed, using Eq. (11.2). If we now define a complex coordinate z from the period of the 
yi-cycle 

z= n, (11.5) 
Ja 

it can be shown using so-called special geometry (see e.g. [151] for the application used here), 
that for the period on the S-cycle it holds that 

/ = log 2: + holom. , (11-6) 

where the second piece is a holomorphic function on the internal manifold. Then, the Kahler 
covariant derivative of the superpotential (11.4) with respect to the coordinate z defined by 
Eq. (11.5), in the limit where 2; <C 1 is 



D^Wc^{2Trya' — logz-i — + ... . (11.7) 
\27rz gs J 

Setting DzW = then gives an expression for the period 2 of A in terms of the flux quantum 
numbers, 

^^^-2.K/Mg._ (11.8) 

On the other hand, we also know that the deformed conifold is described by the equation (8.30) 
for the complex cooridinates Wa with the deformation parameter e^. Since at the tip, the 
(here written as the Poincare cycle A) stays finite, we conclude that = z. What does this 
tell us about the IR cutoff ro? In Section 8.2.2, we showed that oc (strictly speaking, this 
is valid in the limit where r is large enough such that the deformation is not felt yet). Then, 
we can conclude that r oc z^^^, and in exact terms we obtain for the ratio between R and ro in 
terms of the fluxes that 

Note that this can be interpreted as the (simple) warp factor evaluated at r = ro because 

h{r) oc R^/r^. We therefore now have an expression of both R and Tq in terms of background pa- 
rameters only (recall that N = K M) . This is the underlying geometry of the KKLMMT model 
before any additional (anti-)branes (apart from N branes and M fractional branes "dissolved" 
into the background) are introduced. 



11.2.2. Additional D3-Brane 

We saw earlier that adding (unwrapped) branes to the supergravity background requires two 
additional terms per Dp-brane in the action, namely the DBI part [153], 

Sbp = -Tp [ dP+^C [- det {Gab + Bab + 2Tra'Fab}Y^^ , (11.10) 

J Mr, 
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plus the Chern-Simons coupling to the C(j,^i^ gauge potential, 



(11.11) 



where Mp is the brane's {p+ l)-dimensional world volume, and Tp and fip the tension and charge, 
respectively. [The upper sign in Eq. (11.11) is for branes, the lower for anti-branes.] Gab is the 
induced metric on the brane, and Bab the pulled-back Kalb Ramond anti-symmetric tensor field. 
We set the gauge field Fab on the brane to zero. Since we work in type IIB string theory, the 
number of spatial brane dimensions p is odd, and we concentrate on the case p = 3. Then, the 
brane world volume is four-dimensional, and we align these dimensions with the external part 
M4 of spacetime. Replacing the indices (a, 6) (a, /5), the induced metric and field i?(2) are 
given by 



G. 



al3 — dn 



B. 



al3 



Br, 



dx"^ dx"' 



(11.12) 



It was shown earlier [see Eq. (8.19)] that the B^2) field only has components along the internal 
dimensions of spacetime, therefore there is no pullback onto the brane, B^p = 0. To calculate 
the induced metric, let us make the assumption that the radial coordinate yf = ri of the brane 
depends on time x^ = t (but not on the a;*). (As a motivation, recall that r will eventually 
describe the distance between the D3 and anti-D3 brane, which is the time-dependent inflaton.) 
Then we have from Eq. (11.12) that 



G 



00 



500 + 544 



dri 
~dt 



Gi 



9i: 



i = 1,2,3. 



(11.13) 



With 

gmn the ten-dimensional warped metric with the KS throat as its six-dimensional piece, 
the determinant of this induced metric is 



- det Gn 



1 



(11.14) 



The brane hence is dynamic and can move along the r direction. Using Eq. (11.10) and 
Eq. (11.11) with the upper sign, we obtain for its Lagrangian 



jCB3 = -T3h-\ri) 



dri 
~dt 



1/2 



+ f^3h (ri)«y(^) 



(11.15) 



where we have used the expansion of the square root for small dri/dt, and the fact that T3 = /i3 
in our units. We see that up to a renormalization factor of y/Ts, the radial brane coodinate ri 
behaves like a free scalar field without potential because the gravity and electrostatic (via the 
R-R charge) interactions precisely cancel. In a warped background, a D3-brane will therefore 
stay where it is put (in our case, at ri), or can be moved around without energy cost, which 
means that its world volume scalars (here ri, but potentially also the angular coordinates on 
the internal manifold) are massless moduli fields. 



Note, however, that if the D3-brane is "heavy", it backreacts on the geometry and will manifest 
itself as a small perturbation to the function h{r) of the KS throat. We describe this using 
the ansatz h = h + Sh, where Sh is the perturbation caused by the localized source. Prom the 
Einstein equations, we then obtain a differential equation for Sh, 



^^h) + - {—dh 
av^ I r \or 



C(5(r-ri), 



(11.16) 
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where C is a constant. This is precisely the same differential equation as was found for h{r) 
previously, and therefore has an analogous (localized) solution, 6h oc l/rf. The full solution 
for the perturbed warp factor h can be made plausible from physical intuition: recall that the 
unperturbed h{r) was the consequence of a stack of N D3-branes put into the background 
geometry, creating the throat. Now, we have added one more D3-brane at a fixed position ri. 
Its effect should therefore be suppressed by a factor oc 1/N. In terms of the notation in Eq. (8.7), 
we may write 

Mr) » ^ (l + ^ 9 ^ 

The constant Ci in Eq. (8.7) responsible for the plateau has again be set to zero, and we already 
know that C2 = i?^. In this way, the warp factor is equal to 1 at the edge of the throat R if the 
perturbation by the additional brane is neglected. 



11.2.3. Launching a Test Anti-Brane 

We are now ready to launch a test {i.e. "light") anti-D3-brane. Consider the case of the un- 
perturbed KS geometry first. Let us call the position where the anti-D3 is inserted y| = f 
(for all other extra coordinates we take y2 = Ui, ^ = 5 . . . 9, i.e. the brane and anti-brane are 
coincident in these dimensions). As for the D3-brane above, ro depends on time. Then, since 
the anti-brane has the same tension but opposite charge, its Lagrangian is [compare Eq. (11.15)] 



1 - h{f) 



df 
di 



'"-.3*-'««|(|)^2^. (n.is, 



where we have performed the same expansion as before. For the anti-D3, the gravitational and 
R-R interactions add up, therefore there is a potential for the dynamical coordinate f, and the 
D3 will seek to minimize its energy. If we call T{r) = T3 ^~^(f) the effective brane tension at 
the anti-branes position f, we see that for a KS warp factor h = R^/r'^, the function T(f) is 
minimized at f = tq. Therefore, unlike the D3 brane which stays fixed, the D3 will sink to the 
bottom of the throat under the influence of its DBI action and Cherm Simons term. 

Now consider a background with both D3 and D3 embedded (see Fig. 11.1), and let us switch 
their roles: the anti-D3, sitting at its minimal energy position ro, is considered heavy and 
perturbs the background warp factor as in Eq. (11.17). Because the D3 is fixed, we measure the 
position of the D3-brane relative to its location, i.e. use the distance r = ri — tq as the dynamic 
variable. The D3-brane is light and, when launched in the perturbed background, now feels the 
presence of the anti-brane attracting it. The potential for the combined D3-D3 system in the 
warped KS background can be written as 

nro,r) = ^ = 2T3 \ - 1^ (1 - 1 ^) , (11.19) 

where we have used the expansion for r ^ rg in the last term. Using the canonic renormalization 
= y/T^ri [suggested by the kinetic term in Eq. (11.15)], the brane-anti-brane system has the 
Lagrangian of a canonical scalar field, 

1 fdc^V IT^tI / rlTl 1 



For large values of ^, this potential is extremely fiat and therefore makes ^ a good candidate 
for the inflaton. Note that, because of the warping, V{<^) is fiat inside the throat (where r can 
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at most take the value R, i.e. the UV end). A common problem in earlier models of brane- 
anti-brane inflation was that the brane interaction potential was only flat enough to inflate for 
brane distances exceeding the size of the compact manifold [208]. In the KS throat, the situation 

is different: because it sits at the IR end of the geometry, the tension and charge and hence 
the attractive powers of the anti-D3 are only weakly felt up in the throat where the test D3 is 
launched. 

The derivation of the above potential relied on the exchange of massless closed string modes only 
(gravitons and R-R modes), which means that once the branes get close and other (massive) 
interaction modes are no longer Yukawa suppressed, it becomes invalid. Moreover, there is 
a critical brane distance (the string length £s) at which string with one end on each brane 
can appear. Its spectrum contains a tachyon, which signals that the brane-anti-branes system 
becomes unstable and mutual annihilation sets in. In summary, the Lagrangian (11.20) may be 
used as an effective description while i) the mobile D3-brane is inside the throat (since we do 
not know the metric of the bulk CY), r < R, and while ii) the brane distance exceeds the string 
length, r > 4- 



11.3. Issues of Volume Stabilization 

The simple Coulomb-like brane interaction that appears in the Lagrangian (11.20) is not the 
full story, and we now revisit the assumptions that went into its derivation. After fluxes have 
been turned on, all the complex structure moduli are stabilized. Moreover, in concentrating on 
the brane motion, we implicitly assumed that all Kahler moduli including the overall volume 
were also fixed. (Otherwise, the volume modulus would have a runaway potential, making it 
the direction of steepest descent in moduli space, and the flatness of the brane world volume 
modulus r could not be exploited for inflation.) We saw in Chapter 9 that the (complex) Kahler 
modulus p can be stabilized by non-perturbative effects of the superpotcntial. But does the 
KKLT procedure carry through to a background that includes a mobile D3-brane? Note that 
the other ingredient of [184], i. e. uplifting of the AdS minimum to a dS one by adding a small 
number of anti-branes is naturally incorporated in the above setup by the D3 sitting at the end 
of the throat. 

We now adress the question whether the total compactification volume in the KKLMMT is 
indeed stabilized by the non-perturbative approach used to fix p in [184]. Recall that in Chapter 
9 the real part of p was an axion (descending from the four- form potential) , while its imaginary 
part a was the total compactification volume, and the goal was to stabilize a. In the presence 
of branes, the definition of the compactification volume is more complicated. 

11.3.1. Kahler Potential 

After flux compactification, the modulus p in a background without D-brancs has the Kahler 
potential K{p,p) = — 31og[— i(p — p)], compare Eq. (9.5). However, when there are mobile 
D-branes embedded, the combined Kahler potential for p and the world volume scalars of the 
D-brancs (which we summarily denote by (f), since in the KKLMMT scenario one of them is the 
inflaton) is^ 

K{p, p, </>, ^) = -3 log [-i{p -p)- k{4>, ^)] . (11.21) 

^Note that there is a change in notation p — >• —ip between the papers [184] and [208], which is why Eq. (11.21) 
in the latter publication reads K{p, p, (f>, 0) = —3 log[p + p — k{(f), (f>)]. In particular, this exchanges real and 
imaginary parts of p. 
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Note that the Kahler potential k{<t>,(t)) for the D3 world volume scalars should just be the 
Kahler potential of the Calabi Yau compactification, since the D3 are transverse to the the 
internal manifold, and the compact CY dimensions therefore represents their possible locations. 
The real part of p is the axion arising from a four- form on the internal manifold. But we know 
that D3 branes couple to four-form potentials because their world volume is four-dimensional - 
therefore the remaining Kahler moduli space after compactification [whose Kahler potential is 
(11.21)] is not a direct product between those of p and (f). Because of the non-trivial relation 
between the axion and the brane moduli the good complex variable (in which the metric 
on moduli space takes a Kahler form) is no longer p, but a new complex quantity /?', where 
5i(/9) = 3f?(/9') (i.e. still the axion) and '^{p') is related to the proper volume modulus r by 



It is the size of relative to the string scale a' which controls the viability of the supergravity 
expansion for a background including D-branes. Give that the KKLT stabilization procedure 
addressed p, we have to consider what the fixation of p means in terms of r. 

11.3.2. Superpotential 

From the flux compactification without D-branes, we found a constant superpotential W oc 
/m6 ^(3) ^ ^' iiidependent of p. For supersymmetry to be preserved, ie. for DiW = 0, the 
imaginary self-dual three-form flux G(3) must only contain a (2,l)-piece, which gives = 0. If 
there is a contribution of the (3,0)-form, it will produce a non-zero constant, and we therefore 
write W = Wq in general. Note that with this superpotential one still obtains the no scale 
cancellation in the scalar potential F, which leads to F = 0, and p is still massless at this level. 

In [184] it was suggested to fix p by a non-perturbative correction to the constant flux super- 
potential as in Eq. (9.7). In the KKLMMT scenario, we are in a flux background that includes 
D-branes, and we want the 4> (which represent the position of the D3s on the CY manifold, 
i.e. in the above case of one D3-brane, there should be a triplet of complex (j) fields, one of which 
is the radial coordinate we studied earlier) to move freely, but the volume stabilized. Note that 
fixing the six-dimensional volume modulus is obligatory if we want to build an infiationary sce- 
nario from the above Lagrangian in Eq. (11.20): for inflation, we must have a four-dimensional 
Einstein Hilbert action in which the Planck mass is constant. Very often, this is achieved by 
truncating the KS throat at its UV end i?, because once the throat joins the bulk, calculational 
power is lost because the metric on the compact CY is unknown. (In Randall Sundrum models, 
the same effect is obtained from placing the second, "Planck" brane at a large distance.) 

11.4. The r] Problem 

To summarize, the true fleld which controls the size of the manifold and the viability of the a' 
expansion is r defined in Eq. (11.22). What are the consequences for the potential of the D-brane 
world volume scalars (f)l The difficulty is that for any positive energy configuration (which we 
need for a de Sitter minimum) using known string theory sources (such as anti-D3-branes and 
fluxes), the scalar potential will fall off as a negative power of r, i.e. schematically we can write 



2r = -{p-p)-k{ct>,4>). 



(11.22) 



[208] 




X{p) 



2X{p) 



(11.23) 



r' 
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The form of the numerator depends on the chosen source of the energy. But the non-perturbative 
KKLT stabiHzation procedure is directed towards p rather than r. Therefore, if the D3-brane 
with radial world volume modulus (j) moves, the potential changes to 



This generically yields a mass contribution of 0(1) to the inflaton potential - unless one can 

arrange for a cancellation. One option for this is to set X{p^ (j)) (i.e. the superpotential W must 
be a function not only of p, but also of (j)). Even if they are non-generic, there could then 
exist superpotentials for which the combined inflaton mass term vanishes. Note that this is not 
unlike the 77 problem we encountered earlier in the context of inflationary models derived from 
supcrgravity (see Section 3.2.3): while generically the second potential slow roll parameter r/y 
is large, inhibiting prolonged slow roll inflation, there exist special combinations of Kahler and 
superpotentials for which r/y is small due to cancellations. 

Because this is a subtle effect, let us rephrase it again: while W = Wq is a constant, a D3-brane 
put in the flux background does not feel a force, and we used this explicitly when calculating the 
brane-anti-brane interaction potential in Section 11.2. However, once the D3 starts to move (in 
the KKLMMT scenario, due to its attraction versus the anti-brane), the balancing mechanisms 
try to adjusts the force cancellation at the branc's new position [9]. As long as the Kahler 
modulus p is massless (i.e. a modulus in the proper sense), it can be varied to this effect at no 
energy cost. But at present, p has been fixed by a non-perturbative superpotential. Therefore, 
if p adjusts to the new position of the D-brane, there will be an energy penalty imposed. (One 
possible result is that the brane might be localized at a specific position in the throat.) 
There is yet another way to understand this effect. Once we cut off the throat [and therefore gain 
the right to add a four-dimensional Einstein Hilbert action to the Lagrangian (11.20)], there is in 
principle also a conformal coupling term for the brane position r, written as {T^/12)r'^R because 
it is a conformally coupled scalar [209]. Evidently, this gives a contribution to the inflaton mass, 
which is just the effect calculated above in the supergravity setup by considering the shift form 
p to r as the proper volume modulus. 

This manifestation of the r/ problem, i.e. a large inflaton mass hindering slow roll inflation on 
the flat warped brane interaction potential, was already studied in the original paper [184]. The 
main lesson learned was that inflationary model building in string theory cannot be considered 
as a separate issue form moduli stabilization (as it was often done in previous string cosmo- 
logical scenarios). However, KKLMMT argued that a modest amount of fine-tuning [0(lO~^)] 
could suffice to arrange for a cancellation of the dangerous mass term, given that there is a vast 
choice of flux compactifications. Moreover, the ^dependence of the superpotential was largely 
unknown, which inspired the hope that it could be just of the right form (i.e. lead to a term 
quadratic in (p in the four-dimensional scalar potential) to cancel the contribution calculated 
above. Then, the only remaining term in the potential would be the very weak Coulomb at- 
traction between the branes, and slow roll inflation could proceed successfully. In more recent 
work, the functional form of these superpotential corrections was obtained (see below), and 
unfortunately, this miracle does not occur. The Coulomb term is, however, unique among the 
contributions to the potential in the sense that its derivation is straightforwardly possible in 
the type IIB supergravity background studied above, and that its parameters are related in a 
transparent way to the basic string geometry of the KS throat, see Eqs. (11.3) and (11.9). 




(11.24) 
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11.5. Reheating 

The above potential between the D3-brane and the anti-D3-brane was calculated in the limit 
of large distance: the anti-D3 sits fixed at the bottom of the throat, while the D3 probe is 
launched closer to the edge of the throat and starts moving downwards. However, the branes 
come closer and closer while inflation is under way, and eventually their distance will be of the 
order of the string scale ig- It is at this point at the latest that our simplified calculation for 
the potential becomes invalid: once the branes are this close, a tachyon develops describing 
the lowest oscillatory mode of a string between the branes. This tachyon (and other degrees of 
freedom becoming massless) was is not taken into account by the Coulomb potential. 

In the cosmological picture, the appearance of the tachyon triggers the phase of reheating, 
during which the brane and the anti-branc annihilate. In this process, the energy contained in 
the tensions is freed up and can be used for heating the Standard Model degrees of freedom, so 
that the SBBM evolution can set in. However, the reheating efficiency depends largely on where 
the SM particles are to be found: the D3 and D3 we considered so far (whose distance gave 
the inflaton (f>) cannot be used to harbor the Standard Model because they disappear once their 
annihilation is complete. However, there can be an additional D3-brane (or a stack of them) 
sitting in the same throat, which remains once the original D3-anti-D3 brane have decayed. 
However, the Standard Model brane could also live in a different corner of the overall Calabi 
Yau compactification. 

The closed string loops produced by brane annihilation propagate to the Standard Model brane, 
where they couple to the gauge degrees of freedom living on it. Whether the process of reheating 
can be efficient depends largely on the nature of these couplings, and there is the danger that 
the brane tension energy could be channeled more efficiently into an unobservable sector, or into 
bulk degrees of freedom. It seems that strong warping can help to make reheating efficient, see 
Section 10.3.2 and the references given there. 



11.6. Quantum Corrections 

We saw that by embedding brane-anti-brane inflation into a warped compactification of type IIB 
string theory, one can at first sight make the brane interaction potential suitably flat for inflation. 
However, as was already discussed in the original publication [208], it would be inconsistent to 
regard brane motion on the KS geometric background while ignoring moduli stabilization (in 
particular the volume). These issues really are closely intertwined: while it is true that the 
fluxes of the warped background stabilize the complex structure moduli, the non-perturbative 
volume stabilization mechanism previously introduced in [184] is compatible with inflation only 
under additional assumptions. If the superpotential W depends on p only, as was assumed in 
[184], it will generate mass terms in the inflaton potential because the presence of D3-branes 
changes the notion of the compactification volume: p is no longer the good modulus to consider, 
but must be replaced by r. 

Given that the non-perturbative stabilization procedure was directed towards p, the true volume 
modulus r cannot be expected to stay fixed. It can be shown that the induced mass for the 
inflaton is generically of the same size as the Hubble parameter H, and therefore too large to 
support prolonged slow roll inflation. In [208] it was nevertheless argued that the (largely un- 
known) dependence of the superpotential W on (p (on top of its non-perturbative p dependence) 
could allow to cancel this mass term. While such a cancellation would not be generic, modest 
fine-tuning might suffice to make it happen. Therefore, KKLMMT argued that inflationary 
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scenarios dominated by the Coulomb-like attraction term between the branes can be feasible. 

Since the original work of [208] , the superpotential and its dependence on (p have been much bet- 
ter understood. The strategy employed is to cast all forces in the form of low energy supergravity. 
In the non-peturbative ansatz, this amounts to making A a function of (p [210, 211, 212, 213]. It 
was shown that the additional forces on the D3 can lead to a balance of forces for a range of field 
values towards the end of the throat, where inflation may then occur [214]. The potential is of 
the "inflection point" type, and the flat stretch can be engineered to be flat enough to produce 
a sufficient number of e- folds . Unlike the hypothesis made in [208] , the correction terms do not 
have the same functional form as the mass term generated by the conformal coupling. (Instead 
of being oc (f)"^, they are rather proportional to (f)^/^ [211, 212].) Therefore a cancellation can 
only occur for a limited range of (j) values even when parameters are finetuned. 

The Coulomb term we calculated above is unique in the sense that it is always present, and in a 
simple and transparent way related to the background geometry. The other corrective terms are 
much more dependent on finer details such as wrapped D7-branes descending into the throat 
along four-cycles etc. In the spirit of Section 10.4, one might say that it makes sense to use a 
less ambitious construction (i.e. the "pure" KKLMMT Coulomb term only) because complete 
compactification data (i.e. all non-perturbative corrections) is not available in any case. Put a 
different way, we choose "the devil we know" over the unknown corrections. Note that there 
is also a cosmological argument to be made for restricting the potential in this way, which will 
be illustrated by the article presented in the next Chapter: the more parameters there are in a 
string cosmological model, the lesser the hope that the link to CMB observations may tell us 
something about the underlying geometry. 
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Chapter 11. Brane Inflation in a Warped Throat 




Figure 11.1.: Left: Poincare dual 
cyles A and S in 
the deformed coni- 
fold. The (green) 
!B is closed once the 
compactified bulk 
is included. Right: 
D3-brane and anti- 
D3-brane embedded 
in the Klebanov 
Strassler throat, 
i.e. the deformed 
conifold. Fluxes 
deform the tip of the 
throat such that it is 
non-singular and ends 
at a finite value of the 
radial coordinate rg. 
At its UV end R, the 
throat is joined to the 
a compact CY bulk 
manifold. The anti- 
D3-brane sits fixed 
at the bottom of the 
geometry, while the 
test D3 is launched 
close to the edge of 
the throat. Inflation 
occurs while the D3 
moves downwards in 
the throat, attracted 
by the anti-D3-brane. 




V 
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Part III. 

Results and Publications 
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12. Contraints on Brane Inflation from 
WMAP3 



The objective of the first article published during this thesis is a clear-cut com- 
parison of the KKLMMT brane-anti-brane inflation model we studied in Chapter 11 
to the ( at the time ) most recent measurement of the cosmic microwave background 
temperature fluctuations, provided by the three-year data release of the WMAP satel- 
lite. The background of the KKLMMT model in type IIB string theory was studied 
thoroughly in this paper, and the string theoretic meaning of the "visible" cosmo- 
logical parameters established. In particular, the string coupling g^ and the string 
scale a' were not fixed a priori , but equally considered as free parameters to explore 
in the Monte Carlo Markov Chain analysis. Consistency relations derived from the 
underlying string model were imposed as priors on the parameters where applicable. 
A detailed description of the numerical methods as well as a careful interpretation of 
the obtained probability distributions is presented. 



12.1. "Brane inflation and the WMAP data: a Bayesian analysis" 
(article) 
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13. Kinetically Modified Inflation and the 
WMAP5 Data 



In the previous article on the original KKLMMT scenario, the inflaton potential 
was given by a pure Coulomb term due to the attraction between an anti-D3-brane 
(fixed at a "redshifted" position at the bottom of the Klebanov Strassler throat) and a 
mobile D3-brane. This potential is extremely flat and inflation takes place in the slow 
roll regime. However, additional terms generically appear in the potentials for string 
inflaton fields, which can render these potentials too steep for conventional slow roll. 
Accelerated expansion may still be possible because the inflaton has a non-canonical 
kinetic term. 

Inflationary scenarios with modified dynamics are commonly called "k -inflation" , and 
in this article we derive their scalar and tensor power spectra using the uniform ap- 
proximation. The non-standard dynamics for the homogeneous background field lead 
to a non-trivial sound speed for the Fourier modes of the field perturbations. The 
standard solution to the perturbation equations in terms of Hankel functions can no 
longer be used, but if the k-inflationary slow roll conditions are satisfied (i.e. when 
both the sound speed and the Hubble radius change only slowly), the equations can be 
solved with the so-called uniform approximation. In string theory, the kinetic term 
of the inflaton in brane inflation is of the Dirac Born Infeld (DBI) type, which is a 
special subclass of k -inflation. 

In a second step, the resulting spectra are compared to the five-year release of the 
WMAP satellite. It is found that in k-inflation the notion of the parameters con- 
strained by the data changes, and that therefore e.g. there is no longer an upper limit 
on the first Hubble flow parameter ei, as it is usually the case for the standard spec- 
tra. Instead, only the combination ei/7 is constrained. However, when restricted 
to the DBI subclass, this limit is recovered because in DBI scenarios the form of 
non-Gaussianities (an additional observable) is known and can be used to break the 
degeneracy between the sound speed and e\ . 



13.1. "/c-inflationary power spectra in the uniform approximation" 
(article) 

13.2. "Constraints on kinetically modified inflation from WMAP5" 
(article) 



141 



14. Tachyon Entropy Perturbations at the End 
of Brane Inflation 



A phase of brane-anti-brane inflation typically ends with the mutual annihilation of 
the branes, which corresponds to reheating of the Universe in the cosmological picture. 
The interest of the following article is to study the dynamics in the very early stages 
of reheating in more detail: a tachyon appears when the branes are sufficiently close 
to each other to start the annihilation process, and for a short period of time, there 
are two dynamical fields. Together, they can create perturbations of entropy type, and 
it is investigated under which conditions these can grow exponentially, resulting in an 
accumulated contribution to the comoving curvature (which, in one field models, is 
due to adiabatic perturbations only). It is shown that, in the absence of backreaction, 
there exist parameter values for which the part of the comoving curvature perturbation 
induced by entropy fluctuations is of the same size as the adiabatic contribution. 



14.1. "Entropy Fluctuations in Brane Inflation Models" (article) 
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15. Brane Monodromy Inflation and Reheating 



In the previous articles, the underlying string picture behind the inflationary sce- 
nario under scrutiny was a mobile D3-brane in a type IIB superstring theory back- 
ground enriched with fluxes. At present, we turn our attention to a model of brane 
inflation constructed in the dual type IIA background: a D^-brane is wrapped along 
a "monodromic" direction on the compactification manifold (which consists of two 
twisted tori). Initially, the world volume energy of the D4-brane is not minimized, 
and it will hence seek to reach its minimum by unwinding in the direction of the mon- 
odromy. This scenario is one of the rare realizations of large field inflation found in 
string theory. 

The phenomenological picture of reheating is also significantly different in mon- 
odromy inflation: in type IIA theory, the Standard Model (SM) of particle physics 
can, for example, live on a D6-brane, to which the D4 can transfer energy through 
collisions. In this article we study the consequences of such a SM D6-brane localized 
at a fixed position in the monodromic coordinate (along which the D4 is unwinding). 
The D4 will therefore hit the D6-brane repeatedly while it travels towards its world 
volume energy minimum. At each collision, the branes align, reaching a state of 
enhanced symmetry. We use a simple Lagrangian to model the interaction between 
the inflaton and the SM particles, and describe their coupling by strings between the 
branes that are created and then stretched at each brane encounter. It is, however, 
found that these strings are diluted to negligible density by the inflationary expan- 
sion, therefore no energy is transferred towards the SM when the branes meet during 
inflation. Reheating takes place instantaneously at the last brane collision after the 
D4 has completely unwound. 



15.1. "Reheating in a brane monodromy inflation model" (article) 
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Part IV. 

Conclusions 
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16. Conclusions 



At the end of this thesis we look back to our starting point, the recent efforts to 

incorporate the cosmological scenario of inflation into string theory. The progress 
of the so-called brane inflation models is m,ost striking, and several of them are 
able to match the most recent data with their predictions. We assemble our own 
contributions to the study of these models, which were presented in detail in Part 
III, and we put them in the context of both experimental and theoretical advances 
expected in the near future. 

The goal of this thesis was to understand the oiigiu and study the consequences of inflationary 
scenarios based on string theory. A natural motivation for this lies in the mutual completion 
promised by a combination of early Universe cosmology and high energy physics: phenomcnolog- 
ically successful, the scenario of inflation has yet to be put on the firm footing of an underlying 
theory. Ambitiously aimed at the unification of all physics, string theory is still in search of 
decisive signatures relevant at energy scales one can realistically hope to probe. 
Today, we arc in the fortunate situation that significant advances on both sides of the aisle 
enable us to bridge this gap: cosmological observations have become sufficiently precise to make 
quantitative statements about the very first moments of the Universe's history. String theoretic 
constructions, on the other hand, are now understood adequately enough to reduce the degrees 
of freedom during a stringy period of inflation to a manageable number and build well-defined 
scenarios. 

Our first project, presented in Chapter 12, was devoted to a detailed analysis of the KKLMMT 
brane-anti-brane inflationary scenario. Starting from its type IIB supergravity origin, we iden- 
tifled the effective cosmological parameters of this model and broke its evolution up into distinct 
regimes (quantum fluctuation dominated at very large fleld values, a slow roll phase produc- 
ing nearly all observable e-folds , a negligibly short period of DBI dynamics, and finally brane 
annihilation/reheating). We then integrated the evolution of background and perturbations dur- 
ing inflation numerically and applied a phenomenological description to the complex reheating 
phase, before propagating the primordial spectra through the SBBM evolution using the CAMB 
code. The Monte Carlo Markov Chain comparison of the KKLMMT model to the WMAP3 
data lead to probability distributions (and, in some cases, limits) for its cosmological and con- 
sequently its underlying stringy parameters. 

While the DBI phase was found to be unimportant in the "pure" KKLMMT model with a 
Coulombic potential term (from brane-anti-brane attraction) only, the modified dynamics due 
to the the open string mode character of the inflaton can be crucial for more general potentials. 
Under the name of fe-inflation, scenarios with a non-canonical kinetic term were even studied 
long before the current burst of activity in string-inspired model building. In Chapter 13, we 
present the calculation of ^-inflationary scalar and tensor perturbation spectra in the analogue 
of the slow roll limit for standard inflation: because of the modifled kinetic term, the scalar 
perturbations have a sound speed c^ ^ 1 which, to justify the slow roll limit, can only change 
slowly with Cg /Cg <C 1 (in addition to the slow variation of the Hubble parameter required in the 
standard case). A comparison of both general ^-inflation scenarios and their DBI type subclass 
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to the WMAP5 data was provided in a separate publication (Chapter 13). 
Inflation in the KKLMMT scenario ends when the D3-brane ahghts onto the anti-D3 and they 
start annihilating into closed string modes. The tell-tale sign of this process is the appearance 
of a tachyon field. In Chapter 14, we studied the phenomenology of the temporary two-field 
model (inflaton and tachyon) at work during the early stages of reheating. As is always the case 
if more than one field is dynamic during inflation, entropy perturbations on top of the standard 
adiabatic ones can develop, and their growth was analyzed under simplifying assumptions in 
Chapter 14. It was found that there exist parameter choices where the induced curvature per- 
turbation due to the entropy fluctuations is of the same size as the adiabatic contribution. 
The KKLMMT scenario is derived from type IIB superstring theory. Recently, a new model 
of brane inflation based on the dual type IIA theory was proposed, in which the inflaton is 
identified with the recurring motion of a D4-brane around a "monodromic" coordinate of the 
compact background geometry (which, in the simplest case, is a product of twisted tori). The 
striking new feature of this construction is the large range of values the infiaton can cover in 
field space, which, in the brane-anti-brane models studied so far, was always limited by the 
maximum size of the compact dimensions. Reheating in "monodromy inflation" occurs when 
the D4-brane unwinding in the monodromic direction collides with e.g. a (fixed) D6-brane on 
which the Standard Model particles are localized. It was shown in the work of Chapter 15 that, 
even though the D4 and the D6 collide multiple times during infiation (the D6 is, figuratively 
speaking, "stuck in the way" of the D4), the energy stored in the inflaton potential and the 
brane dynamics is transfcred to the Standard Model only at their last encounter after inflation 
has ended. The reheating temperature is found to be high. 

As we have seen, the stringy point of view on various building blocks of the inflationary scenario 
requires the generalization of standard tools to account, for example, for the modified kinetic 
term of the inflaton. On the other hand, the fundamentally geometric interpretation string 
theory applies to physics can, in the context of inflation, be used to justify parameter ranges 
such as the distance covered by the inflaton in field space: for example, if the inflaton is the 
separation of branes along one of the compact dimensions of the string theoretic spacetime, it 
cannot take values exceeding the total size of this dimension. 

In our work, we have tried to strengthen the link between string theory and cosmological sce- 
narios by insisting on a consistent and, as far as possible, complete translation of the former's 
concepts into the latter's observational quantities. It was shown that, in principle, string cos- 
mology makes it possible to "read off" string parameters (such as the volume of compactified 
dimensions) from precision measurements of cosmic microwave background anisotropics. In this 
undertaking we are, however, up against two intrinsic challenges: while a unique theory, it has 
been realized that string theory has a very large number of possible vacuum states, and each of 
these vacua in turn can arise from many different combinations of its parameters, all of which 
are intertwined via their geometric interpretation. At the "theory end", fixing the quantities 
entering into a given string inflation model is therefore a task that currently defies successful 
completion. Moreover, on the cosmological side, it is the built-in phenomenological robustness 
of inflation that may keep us from learning too much about its microphysical origin: inflationary 
predictions depend on few and generic observables, and the window of scales through which we 
may grasp a look at early Universe physics is restricted to a few e-folds . Therefore, distinguish- 
ing a string theoretic model from a purely field theoretic ansatz can be a delicate problem. 
Nevertheless, we can today more than ever hope to rise to this challenge. The prospect of mak- 
ing contact with observations through cosmology has reinvigorated string theorists' efforts to 
deduce a full-scale description of our Universe from its earliest moments onwards from super- 
string theory. Cosmologists have every reason to share the excitement: the Planck satellite has 
just left its launch pad in Kourou and will start collecting several years' worth of data on both 
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Chapter 16. Conclusions 



the CMB temperature fluctuations and their polarization. Thence, the decryption of intricate 
details about the era of inflation in the primordial Universe may be within our grasp before long. 




Figure 16.1.: Left: The Planck satellite mirror before packaging. Right: Herschel and Planck 
integrated in the Ariane 5 rocket, ready for take off on its launch pad. Lift off from 
the ESA basis in Kourou (French Guyana) took place as scheduled at 15:12 CEST 
on May 14th, 2009. [Pictures from the website of the European Space Agency 
(ESA)] 
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A. Remarks on Geometry and Topology 



In this Appendix, wc assemble some elements of geometry and topology used throughout Part 
II of this thesis. An introduction to these topics accessible to physicists is found in [155], and 
short summaries for string theoretic applications are included in [151] and [152, 153]. 

A.l. Real Manifolds 

Let M be a compact real d-dimensional manifold with no boundaries. On M, one may define 
p-forms ^(p), with p = 0,1, . . . ,d, from 

A^) = ^ dx"' A • • • A dx"" , (A.l) 

where "A" denotes the antisymmetrized ("wedge") tensor product. The exterior derivative d, 
when applied to a p-form, turns it into a (p + l)-form as 

dA(p) = ^ a^i^^2-/.p+i dx^' A • • • A dx'^^+i . (A.2) 

A p-form is called closed if d^(p) = 0, and it is exact if there exists a globally defined {p— l)-form 

such that ^(p) = dA(^p_iy (From d^ = 0, as can easily be shown from antisymmetry, it follows 
that an exact form is always closed.) The closed p-forms on M form the space C^(M), and the 
exact p- forms form the space 2,^(M). The pth de Rham cohomology group is defined as their 
quotient: 

:KP{M) = e*'(M)/2,^'(M) (A.3) 

Within IK^(M), two closed p- forms therefore are equivalent if their difference is an exact form. 
The dimensionality of "K^CM) gives the Betti number bp. The Euler characteristic of the manifold 
x(M) then is obtained from an alternating sum of Betti numbers, 

d 

x(M) = ^(-l)^6i(M). (A.4) 

Like the cohomology groups ;K^(M) of a manifold are defined from the action of the operator d, 
one can define the homology groups from the boundary operator 5 acting on submanifolds of 
M. (By 5Ji, we therefore mean the boundary of Ji. The sign of 5J^ accounts for its orientation.) 
Note that again 5^ = {) because the boundary of a boundary is zero. 

A p-chain Zp is a linear combination of submanifolds of dimension p. (A p-chain that has no 
boundary is closed, and a p-chain that is a boundary is called exact.) A closed chain is also 
called a cycle, and then 5zp = 0. Two p-cycles are equivalent if and only if they differ by only 
a boundary. The simplical homology group consists of equivalence classes of p-cycles and is 
denoted by "KpiM). 



152 



A. 2. Complex Manifolds 



Stokes' theorem for a real manifold M of an arbitrary number of dimensions d may be written 
as 



dA(p) = / , (A.5) 

where Tsf is an arbitrary {p+ l)-chain. This describes the so-called Poincare duality, which is an 
isomorphism between the groups "K^ and ^d-p of M. For every closed p-form and closed 
{d — p)-form B(^^_p^ , there exists a relation between the manifold M and the {d — p)-cycle N: 

/ ^(p) A 5(d-p) = / -B(d-p) (A.6) 
Jm Jj< 

(Recall that M is closed and therefore has no boundary.) Using Poincare duality, one can 
determine the Betti number by counting those p-cycles of M which are not boundaries. 

A manifold with a positive-definite metric is called a Riemannian manifold, and with a metric of 
indefinite signature it is a pseudo-Riemannian manifold. The metric allows one to calculate the 
(coordinate-independent) infinitesimal line element ds^ = gmndx^^dx^. The Laplace operator 
acting on p-forms Ap in d dimensions is given (in Euclidean signature) by 



A 



p 



dtd + ddt =(d+d^y , dt = {-ifp+d+i ^ d * . (A.7) 



(For Lorcntzian signature, there is an additional minus sign.) The Hodge or star operator * in 
Eq. (A.7) acts on p-forms and is given by 

* (dx'^i A ■ ■ ■ A dxi^-) = ^^_py_^g^y2 • • • 5m... dx--+i A ■ ■ ■ A dx'^^ (A.8) 

with e'*i "^p'^p+i 'Md the Levi Civita symbol. A p-form is harmonic if and only if Ap^(p) = 0, 
and consequently a harmonic p-form is closed and co-closed (meaning SA(^p^ = 0). The Hodge 
dual turns a closed p-form into a co-closed {d — p)-form, therefore it can be understood as an 
isomorphism between the space of harmonic p-forms and the space of harmonic (d — p)-forms, 
and for the Betti numbers it follows that bp = h^-p- 



A. 2. Complex Manifolds 



A complex manifold M of complex dimension n is defined using complex local coordinates 
z", ^" (a = 1, . . . , n). A real manifold of even dimension d = 2n is a complex manifold if the 
following conditions are satisfied: firstly, determine whether a tensor called the almost complex 
structure J^^, which satisfies Jm^J^^ = —^rrJ', exists. Secondly, check whether J"*^^ is an actual 
complex structure by calculating the so-called Nijenhuis tensor. If this tensor vanishes, J™"^ is 
a complex structure. In that case, one can choose a local complex coordinate system (z", z") in 
every open set on M such that J"*^ is given by 

J™, = i5^\ J^^ = i5^\ J^^ = X\ = . (A.9) 



Like p- forms on real manifolds, on a complex manifold one can define {p, (/)-forms with p holo- 
morphic and q anti-holomorphic indices: 

\,<i = i K,-a,l,-l, dz«i A • • • A d^«- A d#i A • • • A dz^'^ (A.IO) 
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The exterior derivative (which is real) can then be decomposed into a holomorphic and an anti- 
holomorphic piece, d = d+d, where one uses the notation d = dz" {d/dz"-) and d = dz°' {d/dz°'). 
These operators take a (p, q)-foTm to a {p+l, q)- or s, {p,q + l)-form, respectively. 

On a complex Riemannian manifold, the line element can be written in terms of local complex 
coordinates as 

ds2 = dz" dz^ + g^i dz« d# + g-a, dz» dz^ + g-^i dz^ dz^ . (A.ll) 

For the metric components, one has 5^5 = {gab)* and g^^ = {gab)* from the requirement of reality. 
On a Hermitian manifold, g^^ = gab = 0. 

By the Dolbeault cohomology group JC?'"^ (M) of a Hermitian manifold M we mean the equivalence 
classes of 5-closed (p, (7) -forms, and the dimension of Jf|'^(M) is called the Hodge number h^''^. 
The Laplacian operators for complex manifolds are 

Aq = dd^ + d^d, Ag = BB^ + B^B . (A.12) 

A Kdhler manifold is defined to be a Hermitian manifold on which the Kdhler form J is closed: 

J = igab d^" A dz^, d J = (A.13) 

The metric on these manifolds therefore satisfies dagbc = c^b^ac (and the complex conjugate), 
therefore one may write the metric as the derivative of a Kdhler potential K, 

Note that the Kahler form then is J = iddK, which means that the Kahler potential is only 
defined up to addition of arbitrary (anti-)holomorphic functions, K{z, z) = K{z, z) + f{z) + f{z). 
On Kahler manifolds, = 2Aq = 2Aq, and for the cohomology groups defind with respect to 
the operators B, 8 and d, respectively, one has 

?C|'*(M) = Oi^'^M) = OiP'i{M) . (A. 15) 

This means that the Hodge and the Betti numbers are related by bk = J2p=o h^'^~^- 

If C(j is a (p, g)-form on a Kahler manifold with n complex dimensions, then the complex conjugate 
form oj* is a (g,p)-form. Therefore the Hodge numbers of a Kahler manifold are related by 
f^P,q — j^q,p Prom the operation of the Hodge star given in Eq. (A. 8), it follows that *uj is a 
{n — p,n — g)-form and /j^^P'""? = /jP'?. 

On a Hermitian manifold, only the mixed components of the Ricci tensor are different from zero, 
and one can define the (1,1) Ricci form from 

Jl = iR^ldz'' Adz'' . (A. 16) 

The Ricci form is closed, dJl = 0, and therefore belongs to the cohomology class of 5{^'^(M), 
which is called the first Chern class ci = (l/27r) [3?] . 

There is one more property of a Riemannian manifold M (with real dimension d) we refer to 
in Section 6.2: the holonomy group !K(M) describes the way various objects transform under 
parallel transport around closed curves. The most important examples (which have so-called 
special holonomy) are C U{d/2) if M is a Kahler manifold, and C SU {d/2) if M is a Calabi 
Yau manifold. 
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